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(57) Abstract 

A process is disclosed for preparing a 
protein by a eukaryote transformed by multi- 
copy integration of an expression vector into 
the genome of a yeast, such as Saccharomyces* 
Hansenula, and Kluyveromyces, or of a mould 
such as Aspergillus, Rhizopus and Trichoder- 
ma, said expression vector containing both an 
"expressible gene" encoding said protein and 
a so-called "deficient selection marker needed 
for the growth of the yeast or mould in a spe- 
cific medium", such as the LEU2d, TRPld or 
URA3d gene, in combination with a ribosomal 
DNA sequence, resulting in stable high copy 
integration of 100-300 copies per celt. This 
multicopy integration results in an increased 
production of the desired protein, which can 
be guar a-galactosidase, an oxidase or a hy- 
drolytic enzyme such as a lipase. 
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Process for preparing a protein by a fungus transformed 
bv multicopy integration of an expression vector. 

In a major aspect, tlie invention relates to a process 
5 for preparing a, homologous or heterologous, protein by , 
a yeast f transformed by multicopy integration of an 
expression vector into the genome of the yeast, said 
expression vector containing both an "expressible gene" 
encoding said protein and a so-called "deficient 
10 selection marker needed for the growth of the yeast in a 
specific medium". 

Although most experiments have been carried out with 
yeasts, it is envisaged that the invention is also 
applicable to moulds. Therefore in this specification in 
15 addition of either yeast or mould the term "fungus", or 
its plural form "fungi", will be used which covers both 
yeasts and moulds. 

In this specification the expression "expressible gene" 
means a structural gene encoding a protein, either 
20 homologous or heterologous to the host organism, in 

combination with DNA sequences for proper transcription 
and translation of the structural gene, and optionally 
with secretion signal DNA sequences, which DNA sequences 
should be functional in the host euJcaryote. 

25 

In this specification the expression "deficient 
selection marker needed for the growth of the yeast or 
mould in a specific medium" is used for a marker gene 
containing a promoter and a structural gene encoding a 
30 polypeptide or protein, said polypeptide or protein 

- either being needed for the production of an ingre- 
dient, such as amino acids, vitamins and nucleo- 
tides, which ingredient is essential for the 
growth of the yeast or mould; in this specification 

35 such ingredient is also called "essential 

nutrient", 

- or being needed for the protection of the cell 
against toxic compounds, such as antibiotics or 
Cu 2+ ions, present in the medium, 

40 provided that the deficient selection marker results 

- either in sub-optimal de novo synthesis of said 
polypeptide or protein, which in turn results in a 
sub-optimal production of the essential ingredient 
or in sub-optimal protection against said the toxic 

4 5 compound , respectively , 

- or in de novo synthesis of a modification of said 
polypeptide or protein having a sub-optimal 
efficiency in the production of said essential 
ingredient or in sub-optimal protection against 

50 said toxic compound, respectively. 

Thus the word "deficient" is used to indicate both the 
sub-optimal synthesis of the polypeptide or protein, and 
the production of a polypeptide or protein having sub- 
optimal efficiency in the actions for the cell as 

55 mentioned above. 
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Examples of such marker genes include auxotrophic 
markers such, as the LEU2 , the TRP1 and the URA3 genes, 
antibiotic resistance genes such as the G418 resistance 
gene and the chloramphenicol resistance gene, and the 
5 gene encoding the enzyme catalase which can protect the 
cell against H20 2 . 

BACKGROUND OF THE MULTICOPY INTEGRATION ASPECT OF THE 
INVENTION 

10 An example of "a so-called "deficient selection marker 
needed for the growth of the yeast" is the LEU 2d gene 
described by Kingsman c.s. (reference 1), who described 
the development of a multicopy integrative vector which 
was dispersed throughout the genome using the transposa- 

15 ble Ty element: 1 Ty 1-1 5 (reference 2). The element was 
engineered ^to contain two selectable markers, TRF1 
(reference 3) and LEU2 from pMA3a, and the PGJC expres- 
sion signals from pMA91 (reference 4) with an IFN-a 2 
coding sequence (reference 5) . A single copy of the 

20 engineered Ty was integrated into the genome using a 
linear fragment to stimulate recombination across the 
ends of the element and thereby replacing an endogenous 
element. Transfonnants were selected for the TRP1 
marker. Few trans formants were obtained by selecting for 

25 LEU2 as insufficient enzyme was produced by a single 
copy of this gene. The trans formant was then grown in 
decreasing concentrations of leucine to select for an 
increase; in the copy number of the LEU 2 gene, presumably 
by spread of the Ty element throughout the genome by 

30 gene conversion and transposition (reference 6) . A 

strain was constructed which produced 8 x 10 5 molecules 
of I FN per cell; this being intermediate between yields 
from single copy ars/CEH vectors (10 5 molecules/ cell) 
and from multicopy vectors such as pMA91 (6 x 10 6 

35 molecules/ cell) ♦ 

For a practical stable production system with a 
transformed yeast the use of Ty elements has certain 
disadvantages . 

40 - For example, Ty elements are homologous to 

retroviral Sequences, wnich are more or less suspect 
materials for production of a protein suitable for 
products for human consumption or in the preparation 
thereof. Thus xt is preferable to find solutions whereby 

45 these more or less suspect materials are not used. 

- Another disadvantage is their property of being 
transposable elements. This has the consequence that an 
appreciable risk exists that the resulting strain is not 
genetically stable, because the transposable TV elements 

50 integrated in the chromosome of the yeast can transpose 
and integrate at other sites of the genome which has 
negative implications for the production process and can 
give problems fn obtaining clearance from responsible 
companies and the authorities . 

55 - In view of their retroviral properties Ty elements 
may result in virus-like particles. This is highly 
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undesirable for practical production processes, because 
instability of genetically modified organisms should be 
avoided. 

- Ty elements only occur in the yeast Saccharom/ces 
5 cerevlsxae . Therefore it is doubtful, whether they can 

be used for other yeasts or even moulds. It is unknown 
whether transposable elements occurring in other 
organisms can be used in a similar way- But even if they 
could, they have the same disadvantages as indicated 
10 above - 

- The copy number obtained with Ty integration is 
about 20-30 with a single maximum of about 40 copies per 
cell. A higher number of 100-300 copies per cell would 
be highly advantageous for commercial production 

15 systems, as higher copy numbers, in general, will result 
in higher expression levels. 

Therefore a need exists for other systems by which 
multicopy integration of heterologous genes in fungi 
20 such as yeast and moulds can be achieved. " 

SUMMARY OF THE MULTICOPY INTEGRATION ASPECT OF THE 
INVENTION 

It has now been found that stable multicopy integration 
25 in s. cerevisiae can be obtained by use of an expression 
vector containing both an expressible heterologous gene 
and a "deficient selection marker needed for the growth 
of the yeast" as above defined and additionally a 
ribosomal DNA sequence, of which the ribosomal DNA 
30 sequence enables stable multicopy integration of said 
expression vector in the ribosomal DNA locus of the 
yeast genome. Surprisingly it appeared to be possible 
with such a system to obtain multicopy integration of 
over 200 copies per cell, which were stable over more 
35 than 70 generations in both batch and continuous 
cultures . 

It has surprisingly been found that not only the known 
LEU2d system but also other "deficient markers" can be 
40 used, in particular a TKP Id or URA3& gene. 

It has further been found that this technique can also 
be applied to other yeasts, in particular of the genera 
Hansenula and Kluyveromyces * 

Thus the principle of using an expression vector 
45 containing a "deficient marker" combined with a 

ribosomal DNA sequence for obtaining multicopy integra- 
tion in a yeast as disclosed above appears to have a 
more general application, for example for other yeasts 
like Pichia or moulds e.g. belonging to the genera 
50 Aspergillus , Rhizopus or Trichoderma , in particular if 
the multicopy integration vectors contain ribosomal DNA 
originating from the host organism. Thus this principle 
is applicable for fungi in general. 

55 The multicopy integration aspect of : the present 

invention provides a process for preparing a heterolo- 
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gous protein, e.g. a lipase, by a eukaryote transformed 
by multicopy integration of an expression vector into 
the genome of the eukaryote, said expression vector 
containing both an expressible gene encoding said 
5 heterologous protein and a so-called "deficient 
selection marker needed for the growth of the 
eukaryote w , in which process said expression vector 
contains ribosomal DNA sequences enabling multicopy 
integration of said expression vector in the ribosomal 
10 DNA locus of the eukaryote genome* 

It has further been found that an expression vector as 
herein before described can be stably maintained at a 
high copy number, when a fungus transformed according to 

15 the invention is grown in a so-called "complete" or non- 
selective medium, which contains all the ingredients 
necessary for growth of the fungus. Normally one would 
expect that do novo synthesis is not required due to the 
presence of the essential ingredient in the medium, 

20 which would result in decreasing the proportion of 
multicopy-integrated yeast cells in the total yeast 
population and thus would lead to a decreased production 
of the desired polypeptide or protein. Surprisingly, 
despite a situation in which de novo synthesis is not 

25 required, the multicopy integration is stably maintained 
and the polypeptide or protein was produced in relative- 
ly large quantities. 

Although the invention is not limited by any explana- 
30 tion, it is believed that the effects observed are 

based on the following theory. For unknown reasons it 
seems that in such a system the uptake of the essential 
ingredient is limited. Therefore, de novo synthesis is 
still needed when the fungus is grown at a growth-rate 
35 above a certain minimum value. This will result in a 
selection advantage for those cells which have a high 
copy number of the "deficient marker". Possibly the 
active uptake of the essential ingredient, e.g. leucine, 
is negatively influenced by the presence of other 
40 components in the medium, such as peptides and valine. 

Thus, in general, the process can be described as a 
process in which said transformed fungus is grown in a 
medium containing said essential ingredient at a 
45 concentration below a certain limit whereby the uptake 
of said ingredient is rate-limiting, so that de novo 
synthesis of said ingredient is required for a growth- 
rate above a certain minimum value. 

50 An example of a complete medium is an industrially 
applied growth ioedium such as molasses, whey, yeast 
extract and com&inations thereof. 

Another embodiment of this invention is the fermentative 
55 production of one of the various, forms of enzymes 

described above or related hosts. Such a fermentation * 
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can either be a normal batch f ermentation, a fed -batch 
fermentation or a continuous fermentation. The selection 
of which process has to be used depends on the host 
strain and the preferred down stream process. 
5 According to this embodiment it is preferred that the 
enzyme is secreted by the microorganism into the 
fermentation broth, whereafter the enzyme can be 
recovered from the broth by first removal of the cells 
either by filtration or by centrifugation. 

10 

In a further aspect, the invention relates to enzymes 
to recombinant DNA techniques applicable for example for 
their modification and production. 

15 In particular embodiments this aspect of the invention 
relates to the production of modified enzymes or 
modified enzymes, especially modified lipases. Thus this 
aspect as described below provides inter alia techniques 
for production of lipase, e.g. lipases of the genus 

20 Pseudomonss , e.g. lipase from P . glumae (alias P. 

gladioli) and further provides genetically modified 
forms of such lipases. 



25 SPECIFIC EMBODIMENTS OF THE MULTICOPY ASPECT OF THE 
INVENTION 

More specifically the invention provides a process for 
preparing a, homologous or heterologous, protein by a 

30 eukaryote transformed by multicopy integration of an 
expression vector into the genome of a host eukaryote, 
said expression vector containing both an "expressible 
gene" as herein before defined encoding said homologous 
or heterologous protein and a so-called "deficient 

35 selection marker needed for the growth of the yeast or 
mould in a specific medium" as herein before defined, 
wherein said expression Vector contains ribosomal DNA 
sequences enabling multicopy integration of said 
expression vector in the ribosomal DNA locus of the 

40 eukaryote genome. Preferably said deficient selection 
marker is a LEU2& gene, a TRPX& gene, or a URA3& gene. 
The eukaryote can be a fungus such as a yeast, 
preferably one of the genera Saccharomyces, Jtluyvex-o- 
mjces or Hansenula , or a mould, preferably one of the 

45 genera Aspergillus, Rhizopus and Trichoderma . 

In a preferred process said transformed eukaryote is 
grown in a medium containing an ingredient, which is 
essential for the growth of the eukaryote, at a concen- 
tration whereby the uptake of said ingredient is rate- 

50 limiting, so that de novo synthesis of said ingredient 
is required for a growth-rate above a certain minimum 
value which value depends on the host organism and the 
process conditions. Preferably such medium is a so- 
called "complete" or non-selective medium, which 

55 contains all the ingredients necessary for* growth of the 
eukaryote, for example, an industrially applied growth 
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medium, such as molasses, whey, yeast extract and 
mixtures thereof. 

In order to obtain sufficient production of a selected 
protein in the process according to the invention it is 
5 preferred that the transformed eukaryote contains the 
gene or genes required for expression of said protein in 
a multimeric form in one of its chromosomes in, or 
directly linked to, a locus coding for a ribosomal RNA 
while at the same locus also multimeric copies of a 

10 deficient gene encoding a protein required in the 

biochemical pathway for the synthesis of said "essjentlaJL 
nutrient" are present. Examples of such expressible gene 
are those encoding an enzyme, preferably a hydrolytic 
enzyme, in particular a lipase, or a genetically 

15 modified form of such enzyme. Particularly preferred 
lipases that can be produced with a process according 
to the present invention are lipases that cross-react 
with antisera raised against a lipase from chromobacter 
\r Is co sum var lipolyticum NRRli B-3673, or with antisera 

20 raised against lipase from Alcallgenes PL-679, ATCC 

31371 or FERM-P 3783, or with antisera raised against a 
lipase from Pseudomonas fluorescein; I AM 1057 , and 
modified forms of such cross -reacting lipase, 
A specially preferred lipase is encoded by a gene having 

25 the nucleotide sequence given in Figure 2 or any 
nucleotide sequence encoding the same amino acid 
sequence as* specified by that nucleotide sequence or 
encoding modified forms of this amino acid sequence 
resulting in a" lipase with a better overall performance 

30 in detergents systems than the original lipase. 

The transformed eukaryote used in a process according to 
the invention is preferably a eukaryote being deficient 
for the synthesis of an "essential nutrient" as herein 
before defined and whereby the deficient selection 

35 marker can contribute to complementation of the 

synthesis of the "essential nutrient". The deficiency of 
the parent strain can be achieved by replacement of a 
gene coding for an enzyme effective in the biosynthetic 
pathway of producing said essential nutrient. It is 

40 particularly advantageous if the enzyme, for which the 
parent strain is deficient, catalyses a reaction in a 
part of the biosynthetic pathway that is not branched 
until the essential nutrient is formed. Examples of 
essential nutrients are amino acids, nucleotide or 

45 vitamins, in particular one of the amino acids leucine, 
tryptophan or uracil. 

Another embodiment of the invention is a process as 
described above, in which the expression vector contains 

(i) a ds ribosomal DNA or part thereof e.g. * ds DNA 
50 sequence that codes for a ribosomal RNA, and 

(ii) a DNA sequence containing in the 5' — > 3' direction 
in the following order: 

(ii) (a) a powerful promoter operable in the host 
organism, 
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(ii) (b) optionally a signal sequence facilitating the 
secretion of said protein from the host 
eukaryote , 

(ii) (c) a structural gene encoding the protein, 
5 (ii) (d) an efficient terminator operable in the host 
eukaryote , 

in addition to the sequences normally present in a 
vector. 

The ribosomal DNA can be ribosomal DNA's occurring in 
10 moulds, in particular moulds of the genera Aspergillus, 
Rhizopus and Trlchoderma , or those occurring in yeasts, 
in particular yeasts of the genera Saccharomyces , 
Zluyveromyces , Ea.nsenula. and Picbia* 
Experiments have shown that the best results are 
15 obtained when the vector has approximately the same 

length as one ribosomal DNA unit of the host organism. 
For example, if the ribosomal unit in the chromosomal 
DNA is about 9 kb, vectors of about 14 kb or 5 kb were 
not stably maintained, but vectors of about 8-10 kb were 
20 stably maintained. 

The promoter controlling the expressible gene is 
preferably 

(i) the Gal7 promoter, the GAPDH promoter, or the PGK 
promoter, if the host belongs to the genus 

25 Saccharomycos , 

(ii) the inulinase promoter, the PGK promoter or the 
LAC4 promoter, if the host belongs to the genus 
Ilu/veromjces , 

(iii) the DHAS promoter or MOX promoter, if the host 
30 belongs to the genus Bensenula , 

(iv) the glucoamylase promoter, glucose-oxidase promoter 
or the GAPDH promoter, if the host belongs to a 
mould of the genus Aspergillus , or 

(v) the cellulase promoter or the GAPDH promoter, if 
35 the host belongs to moulds of the genera Rhizopus 

and Trlchoderma . 
If the structural gene encodes an oxidase, the host cell 
preferably belongs to the genera Hansenula or Plchia or 
Aspergillus • 

40 Another preferred embodiment relates to a process in 

which the expressible structural gene encodes the light 
or heavy chain of an immuno-globulin or preferably both 
genes, or part of the light or heavy chain of an 
immunoglobulin, preferably that part coding for what 

45 normally is called FAB fragment, or that part thereof 
that codes for the variable regions. Related to this 
embodiment is the use of a gene or genes modified by 
genetic engineering resulting in modified immuno- 
globulins or immunoglobulins with catalytic activity 

50 (Ab2ymes) . 

A preferred expression vector additionally contains a 
deficient gene coding for an enzyme that has been 
disrupted or deleted from the chromosome of the host 
cell, more preferably one encoding an enzyme effective 

55 in the biosynthetic" pathway of .producing an essential 

nutrient, such as an amino acid like leucine, tryptophan 
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or uracil, a nucleotide or a vitamin. 

A process according to the invention can be carried out 
as a normal batch fermentation, a fed-batch fermenta- 
tion, or a continuous fermentation. It is preferred that 
5 the medium contains the essential nutrient in such a 
concentration that at least 20, but preferably at least 
50, copies of the deficient gene are maintained in the 
chromosome, said deficient gene encoding an enzyme 
involved in the biosynthesis of that essential nutrient. 

10 Good yields of the protein to be produced by the 

transformed eukaryote can be obtained when the growth 
rate of the host is between 20 and 100 %, preferably 
between 80 and 100 %, of the maximum growth rate of a 
similar host not deficient for said essential nutrient 

15 under the same fermentation conditions. 



BACKGROUND OF THE LIPASE ASPECT OF THE INVENTION 

20 Lipases and proteases are both known as ingredients of 
detergent and cleaning compositions. Proteases are 
widely used. 

Examples of known lipase-containing detergent compos i- 
25 tions are provided by EPA 0 205 208 and EPA 0 206 390 
(Unilever) which relates to a class of lipases defined 
on basis of their immunological relationship and their 
superior cleaning effects in textile washing. The 
preferred class of lipases contains lipases from a.o. p. 
30 fluoresccns , P. gladioli and Chromobactzer species. 

... 

EPA 0 214 761 (NOVO) and EPA 0 258 068 (NOVO) , each give 
detailed description of lipases from certain microor- 
ganisms, and also certain uses of detergent additives 

35 and detergent compositions for the en2ymes described. 
EPA 0 214 761 gives detailed description of lipases 
derived from organisms of the specimen P. cepacia, and 
certain uses therefor. EPA 0 258 068 gives detailed 
description of 'lipases derived from organisms of the 

40 genus Thermomjrc^es (previous name Bumlcola) and certain 
uses therefor. 

A difficulty with the simultaneous incorporation of both 
lipases and proteases into detergent compositions is 
45 that the protease tends to attack the lipase. 

Measures have been proposed to mitigate this disad- 
vantage . 

50 One such attempt is represented by EPA 0 271 154 

(Unilever) wherein certain selected proteases with 
isoelectric points less than 10 are shown to combine 
advantageously .with lipases. 

55 Another attempt is described in WO 89/04361 (NOVO) , 
which concerns tie tergent compositions containing a 
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lipase from Pseudomonas species and a protease from 
Fusarium or proteases of subtilisin type which has been 
mutated in its amino acid sequence at positions 166, 
169, or 222 in certain ways. It was reported that there 
5 was some reduction in the degree of attack upon the 
lipase by the particular proteases described. 



THE LIPASE ASPECT OF THE INVENTION 

10 

The invention in one of its aspects provides lipases 
produced by recombinant DNA techniques, which carry at 
least one mutation of their amino acid sequences f 
conferring improved stability against attack by 
15 protease. 

For example, the invention provides lipases showing 
immunological cross-reactivity with antisera raised 
against lipase from Chromobacter viscosum var. 

20 llpolytlcum NRRI* B-3673 or against lipase from Pseudo- 
monas £luorcscens I AM 1057 and produced by an artifi- 
cially modified microorganism containing a gene made by 
recombinant DNA techniques which carries at least one 
mutation affecting the amino acid sequence of the lipase 

25 thereby to confer upon the lipase improved stability 
against attack by protease. 

The artificially modified microorganisms include 

Escherichia coll , Pseudomonas aeruginosa , P. put: Id a. and 
30 P. glumae in which the original gene for the lipase has 
been deleted. Bacillus subtilis and various varieties of 
the genus Aspergillus , Rhlzopus and Trichoderma, , 
Sac char omyces cerevlslae and related species, Bans enula 
polymorph* , Plchla and related Species, Kluyveromyces 
35 mar xi anus and related species. As these host cells 

reflect a broad range of different micro-organisms other 
microorganisms not described in detail in the examples 
can be used as well as host cells. 

40 The modified lipase can bring advantage in both activity 
and stability when used as part of a detergent or 
cleaning composition. 

In such lipase, the mutation can for example be selected 
45 from 

(a) introduction (e.g. by insertion or substitution) of 
one or more proline residues at a location 
otherwise vulnerable to proteolytic attack; 

50 

(b) an increase of the net positive charge of the 
lipase molecule (e.g. by insertion of positively- 
charged amino acid residues or by substitution of 
neutral or negatively-charged amino acid residues) ; 

55 
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(c) introduction (e.g. by insertion or substitution) of 
a combination of amino acid residues of the lipase 
capable of becoming glycosylated in the selected 
host cell, thereby improving the stability of the 
5 glycosylated lipase against proteolytic attack. 

Also provided 1by the invention is a method for the 
production of a modified microorganism capable of 
producing an enzyme by recombinant DNA techniques, 
10 characterized in that the gene coding for the enzyme 
that is introduced into the microorganism is fused at 
its 5 '-end to a (modified) pre-sequence. 

In particular embodiments of the invention, the gene of 
15 bacterial origin is introduced with an artificial pre- 
seguence into eukaryotic organisms. 

Accordingly, in certain aspects the invention provides 
artificially modified microorganisms containing a gene 

20 coding for an enzyme and able to produce that enzyme 
derived originally £rom one of the organisms mentioned 
above or a modified form of such enzyme by use of 
recombinant DNA techniques and fermentative processes 
for enzyme production based on such artificially 

25 modified microorganisms. 

The fermentation processes in ' themselves apart from the 
special nature of the microorganisms can be based on 
known fermentation techniques and commonly used 
fermentation arid down stream processing equipment. 

30 

According to a further aspect of the present invention 
it is found that modified (mutant) lipases from 
Pseudomonas or "another of the preferred class of 
lipases, with amino acid sequence modification (s) chosen 
35 to increase the stability of the enzyme to protease 
digestion are of value in detergent and cleaning 
compositions, especially for example in combination with 
proteases, e.g. proteases of the subtilisin type. 

40 A suitable and presently preferred example of such a 

mutation is embodied in a mutant lipase from Pseudomonas 
glumae with a His 154 Pro mutation, which is believed to 
replace a site vulnerable to protease digestion in one 
of the loops of the tertiary structure of the lipase 

45 with a less vulnerable site. 

According to a further aspect of the present invention 
it is found that modified (mutant) lipases from 
Pseudomonas or another of the preferred class of lipases 
50 with amino acid sequence modification (s) chosen to 

increase the net positive charge of the lipase and its 
pi, are of value in detergent and cleaning compositions, 
especially for example in combination with proteases, 
e.g. proteases of the subtilisin type. 

55 



PCI7EP90/01138 



10 



WO 91/00920 



PCT/EP90/01138 



11 



Suitable mutations include for example the deletion of 
negatively charged residues (e.g. aspartate or 
glutamate) or their substitution by neutral residues 
(e.g. serine, glycine and proline) or by the 
5 substitution of neutral or negative residues by 

positively-charged amino acid residues (e.g. arginine or 
lysine) or the insertion of positively-charged residues. 

Suitable examples of such mutations increasing the net 
10 positive charge and pi include D157R, D55A and I110K. 

Suitable examples of the introduction (e.g. by insertion 
or substitution) of a combination of amino acid residues 
capable of becoming glycosylated in the selected host 
and thereby improving its stability against proteolytic 
15 attack are given by mutations D157T and insertion of G 
between N155 and T156. 

To avoid over-glycosylation or to remove glycosylation 
on less desirable positions the potential glycosylation 
sites of the original lipase can be removed. 

20 

Within the preferred class of lipases the lipase 
produced by Pseud onion as glumae (formerly and more 
usually called Pseudomonas gladioli) is a preferred 
basis for the processes and products of this invention. 
25 Neither the amino acid sequence nor the nucleotide 

sequence of the gene coding for the preferred lipase was 
previously known. The present inventors have isolated 
the gene coding for the preferred lipase of this 
bacterium as will be illustrated below. 

30 

The invention also provides genetic derived material 
from the introduction of this gene into cloning vectors, 
and the use of these to transform new host cells and to 
express the lipase gene in these new host cells. 

35 

Usable heterologous new host cells include for example 
Escherichia calx , Pseudomonas aeruginosa, P. puclda. 
Also P. glumae in which the original lipase gene has 
been deleted is a suitable host. The preferred host 

4 0 systems for large scale production are Bacillus 

subtilis , Saccharomyces cerevlsxae and related species, 
Xluyveromyces marxianus and related species, Bansenula 
polymorpha, Pichia and related species and members of 
the genera Aspergillus , Rhizopus and Trichoderma . Also 

45 suitable hosts for large scale production are Gram (-) 
bacteria specially selected and/or modified for 
efficient secretion of (mutant) lipases. 
As these host cells reflect a broad range of different 
microorganisms other microorganisms not described in 

50 detail ii\ the examples can be used as well as host 
cells. 

Another embodiment of the invention relates to vectors 
able to direct the expression of the nucleotide sequence 
55 encoding a gene coding for an enzyme as described above 
in one of the preferred hosts preferably comprise: 
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ds DNA . coding for mature enzyme or pre— enzyme 
directly * down stream of a (for the selected host 
preferred) secretion signal; in cases where the 
part of the gene that should be translated does not 
start with^the codon ATG, an ATG should be placed 
in front. The translated part of the gene should 
always end with an appropriate stop codon; 
an expression regulon (suitable for the selected 
host organism) situated upstream of the plus 
strand of the ds DNA of (a) ; 

a terminator sequence (suitable for the selected 
host organism) situated down stream of the plus 
strand of the ds DNA of (b) ; 

nucleotide sequences which facilitates integration, 
preferably multicopy integration, of the ds DNA of 
(a~c) into the genome of the selected host which 
host is deficient for an essential nutrient. The 
nucleotide sequence that facilitates multicopy 
integration is ds ribosomal DNA or at least part of 
this sequence. Moreover a ds DNA sequence 
containing the deficient gene coding for the 
enzyme that is absent in the host cell has to 
be present on the integration vector, and 
optionally a ds DNA sequence encoding proteins 
involved in temporary inactivation or unfolding 
and/or in the maturation and/or secretion of one of 
the precursor forms of the enzyme in the host 
selected* 

30 The invention will be illustrated by the following 
examples* 

Example 1. isolation and characterization of the 

gene encoding (pre) -lipase of P. glumae. 

35 

Example 2. Construction of the lipase negative P. 

glumae strains PG2 and PG3 . 

Example 3. Construction of a synthetic gene encoding 

40 P. glonae (pre) "lipase. 

Example 4. Introduction of the (wild type) synthetic 

lipase gene in the lipase negative p. gluoae PG3. 

45 Example 5, Production of mutant lipase genes and 

their introduction in PG3 ♦ 

Example 6. Expression of the synthetic lipase genes 

in Saccharomyces cerevisiae using, autonomously replicat- 
50 ing smids. 

Example 7. Expression of synthetic lipase genes in 

Saccharomyces cerevisiae using multicopy integration. 



5 

(b) 

10 

(c) 

(d) 

15 
20 

(e) 

25 



55 



Example 8. Production of guar «-galactosidase in 

Saccharomyces cerevlslae using, multicopy integration. 
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Example 9. Multicopy integration in Saccharomyces 

cerevlsiae using other deficient selection markers - 

Example 10. Stability of the multicopy integrant in 
5 continuous cultures. 

Example 11. Parameters affecting the stability of 
multicopy integrant SU50B 

10 Example 12* Expression of the synthetic lipase genes 
in Hansenula polymorph*. 

Example 13 . Production of guar a-galactosidase in 
Hansenula polymorphs using multicopy integration. 

Example 14, Multicopy integration in nuyveromjces. 

Examples 1-6 and 12 relate to the isolation, cloning and 
expression of lipase genes in the yeasts Saccharomyces 
20 cerevls lae and Kansenula polymorphs, using plasmid 
vectors . 

Example 7 relates to the expression of a lipase gene in 
the yeast Saccharomyces cerevislae after multicopy 

integration of an expression vector according to the 
25 invention. 

Examples 9-11 relate to other aspects of the multicopy 
integration. 

Examples 8 and 13 relate to the expression of guar «- 
galactosidase in the yeasts Saccharomyces cerevzslae and 
30 Hans enula polymorphs. . 

Example 14 shows that multicopy integration can be 
achieved in a Kluyveromyces yeast. 



35 EXAMPLE 1. ISOLATION AND CHARACTERIZATION OF THE 
GENE ENCODING ( PRE) -LIPASE OF P. GLUMAE , 

Isolation of P. glumae chromosomal DNA. 

40 Cells of a 15 ml overnight culture in LB medium were 
collected by centrifugation (Sorvall HB4 rotor, 10,000 
rpm for 10 min) . The cell pellet was stored at - 20 °c 
overnight. 

After thawing the cells were re-suspended in 10 ml SSC 
45 (0.15 M NaCI, 0.015 M Na-citrate) containing 2 mg/ml 
lysozyme. After incubation for 30 min at 37 °c, 0.5 ml 
of 10% SDS was added, followed by an incubation at 70 °C 
for 10 min. After cooling to 45 °c r 1 ml proteinase K (2 
mg/ml Tris-HCl pH 7.0, pre-incubated for 30 min at 45 
50 °C) was added and the mixture was incubated at 4 5 °c for 
another 30 min. Next, 3.2 ml 5 M NaCl0 4 was added, 
followed by two extractions with 15 ml CHCl 3 /iso-c 5 H 1 lOH 
(24:1), each of which was followed by a centrifugation 
step (Sorvall HS4 , 5,000 rpm/10 min). The DNA was 
55 precipitated from the supernatant by adding 10 ml 

ethanol. After a wash in 75% ethanbl the, DNA pellet was 
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re-suspended in 2 ml H2O. 

Preparation of a gene bank. 

5 A DNA preparation of P. glumae was partially digested 
with the restriction enzyme Sau 3A, as described by 
Maniatis (7) . The cosmid vector c2RB (8) was digested 
to completion with Sm&X and Bam HI, both enzymes having 
one recognition site in the cosmid. Excess vector 

10 fragments were ligated, using T4 DNA ligase (in 50 mM 
Tris-HCl pH 7.5, 10 mM dithiotreitol (DTT) , 10 mM MgC12 
and 0.5 mM rATP) , with DNA fragments from P . glumae. The 
recombinant DNA thus obtained was packaged in phage par- 
ticles as described by Hohn (9) . The complete phage 

15 particles obtained this way were used to transform £. 
call 1046 oet ( gal, lac, hsd*R, phx , supTL, hsdM, recA) 
by transection. 

5ml fresh LB medium containing 0.4 % maltose, was 
inoculated with 0.5 ml of a overnight culture of E* coll 

20 1046 and incubated for 6 h. at 37 °c under continuous 
shaking. Before infection with phage particles, MgCl 2 
and CaCl? were added to a final concentration of 10 mM. 
In a typical experiment 50 fil phage particles were mixed 
with 50 jil of cells and the mixture was incubated at 37 

25 °C for 15 mint 100 ^1 13 medium was added and incubation 
at 37 °C continued for 30 min. The cells were plated 
directly an LBi-agar plates containing 75 jig/ml ampicil- 
lin (Brocacef ) . After overnight growth at 37 °c ca. 300 
colonies were obtained. 

30 

Oligonucleotide synthesis. 

As probes for the lipase encoding DNA fragment, we used 
oligonucleotides based on the sequences of the 24 N- 
35 terminal amino acids (see below), determined by Edman 
degradation, using an Applied Biosystems Gas Phase 
Protein Sequencer. 

Based on the established amino acid sequence, all the 
possible nucleotide sequences encoding the amino acid 

4 0 sequence were derived. Deoxy-oligonucleotides containing 
all or part of the possible nucleotide sequences (so 
called mixed-probes) were synthesized on a DNA syn- 
thesizer (Applied Biosystems 380 A) using the Phospho- 
amidit technique (10) . Oligonucleotides were purified on 

45 16% or 20% polyacrylamide gels (7). 

Rad^Q-label ed oligonucleotide probes. 

Typically, 0.1-0.3 fig of the purified oligonucleotide 
50 was labelled by incubation for 30 minutes at 37 °c in 50 
mM Tris-HCl pH 7.5, 10 mM MgCl 2 , 0.1 mM EDTA, 10 mM DTT, 
70 ,iCi gamma-^P-ATP (3000 Ci/mmol, Amersham) and 10 
units T4' polynucleotide kinase (Amersham) in a final 
volume of 15 The reaction was terminated with 10 pi 

55 0.5 M EDTA pR 8.0 and passed through a Sephadex G2 5 

column of 2.5 ml (disposable syringe) equilibrated with 
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TE buffer (10 mM Tris-HCl pH 8.0 and l mM EDTA) * 
Fractions of 250 /il were collected, from which the 
first two radioactive fractions, usually fractions 4 and 
5, were pooled and used for hybridization. 

5 

Screening; of the gene bank. 

From several packaging and transfection experiments, 
performed as described above, a total of ca 1000 

10 separate colonies were obtained. These colonies were 

transferred to EIiISA plates (Greiner, F-form) containing 
150 pi LB-medium (loo ampicill in/ml ) /wel 1 . After 
overnight growth at 37 °C duplicates were made using a 
home-made template, consisting of 68 pins, arranged to 

15 fit in the microtiter wells. To the wells of the 

masterplates 50 jxl 50 % glycerol was added, and after 
careful mixing with the aid of the template, these 
plates were stored at -80 °c. 

The duplicates were used to transfer the gene bank to 
20 nitro-cellulose filters (Millipore, type HATF, 0.45 /im, 
4 14 cm) . To this end the cellulose filters were pre- 
wetted by laying them on LB- agar plates with 100 ng/m\ 
ampicillin. After transfer of the bacteria with the aid 
of the template, colonies were grown overnight at 37 °c. 

25 

The colonies on the filters were lysed by placing them 
on a stack of Whattman 3 MM paper, saturated with 0-5 M 
NaOH, 1.5 M NaCl for 15 min. After removal of excess 
liquid by placing the filters on dry paper, they were 

3 0 neutralised by placing them on a stack of 3 MM paper, 

saturated with 1 M Tris-HCl pH 7.0, 1.5 mM NaCl for 2-3 
min. Finally the filters were dunked into 10* x SSC (1.5 
M NaCl, 0.15 M Na-citrate) for 30 sec, air dried and 
baked at 80 °C under vacuum for 2 hours. Prior- to 
35 (pre) hybridization the filters are washed extensively in 
3 x SSC, 0.1% SDS at €5 °C for 16-24 h with several 
changes of buffer. The washing was stopped when the 
colonies were no longer visible. 

40 Pre-hybridization of the filters was performed in 5 x 

SSC, 5 x Denhardts (10 x Denhardts = 0.2 % ficoll, 0.2 % 
polyvinyl-pyrxolidone, 0.2 % bovine serum albumin), 0.1 
% SDS, 50 mM sodium phosphate pH 7.5, 1 % glycine, 100 
Mg/ml calf -thymus DNA (sheared and heat denatured) , 500 

4 5 pg/ml tRNA and 50 % de-ionized formamide for 2 hours at 

37 °C. 

Hybridization with a radio-active labelled (see above) 
mixed probe (vis02, 32 nucleotides) was performed in 5 x 
SSC, 1 x Denhardts, 0.1 % SDS, 20 mM sodium phosphate pH 

50 7.5, 100 f*g/ml calf -thymus DNA, 500 ng/ml tRNA and 50 % 
deionized formamide, for 16 h. at 39 °C. After the 
hybridization, the filters are washed: 3 x 15 min with 6 
x SSC at room temperature, l x 15 min 2 x SSC, 0.1% SDS 
and subsequently at a room temperature dependent on the 

55 properties of the JDligonucleotide probe. For vis02 

washing was extended for 15 min at 37 °C in preheated 
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0.1 SSC 0.1% SDS. 

Upon screening the gene bank as described above, several 
cosmid clones were isolated. Clone 5G3 (hereinafter 
called pDR6000) was chosen for further investigations. 

5 

Sequencing of the lipase gene. 

DMA fragments resulting from digestion of pUR6000 with 
fismHX were ligated in plasmid pEMBL9 (11) which was 
10 also cleaved with BaaHI and the obtained recombinant 
DNA was used to transform E. coll JM101 (12), with the 
CaCl 2 procedure and plated on LB-agar plates 
supplemented with X-gal and IPTG (7) . 

68 white colonies were transferred to microtiter plates 

15 and subjected to the same screening procedure as 

described for the cosmid bank. Several positive clones 
could be isolated. A representative plasmid isolated of 
one of these colonies is depicted in Fig. 1 and is 
referred to as pUR6002. Upon digesting this plasmid with 

20 £coRI, two fragments were found -on gel, -4.1 kb and 
-2.1 kb in length, respectively. Another plasmid, 
pUR6001, contained the BamHX fragment in the opposite 
orientation. After digestion with fcoRI, this plasmid 
resulted in fragments with a length of -6.1 kb and -70 

25 bp, respectively. 

In essentially the same way pUK6006 was constructed . In 
this case pUR6000 was digested with EcoBJ. after which 
the fragments were ligated in the EcoBX site of plasmid 
pLAFRl (13). After screening the trans formants, a 

30 positive clone was selected, containing a EcoRl 
fragment of -6 kb, designated pUR6006 (Fig. 1) . 
The purified DNA of pUR6001 and ptIR6002 was used for the 
establishment of the nucleotide sequence by the Sanger 
dideoxy chain termination procedure (14) with the 

35 modifications as described by Biggin et al. (15) , using 
alpha- 35 S-dATP (2000Ci/mmol) and Klenow enzyme (Amer- 
sham) , ddNTP's (Fnarmacia-PL Biochemicals) and dNTP's 
(Boehringer) . We also used the Seguenase kit (United 
States Biochemical Corporation) , with substitution of 

40 the dGTP for 7-deaza-dGTP. The sequencing reaction 

products were separated on a denaturing poly aery 1 amide 
gel with a buffer gradient as described by Biggin et al. 
(15). 

45 The complete nucleotide sequence (1074bp) of the P. 
glutaae lipase (hereafter also called: glumae lipase) 
gene is given in Fig. 2. 

The nucleotide sequence contains an open reading frame 
encoding 358 amino acid residues followed by a stop 
50 codon. 

The deduced amino acid sequence is shown in the IUPAC 
one-letter notation below the nucleotide sequence in 
Fig. 2 . 

55 The NH2 -terminal amino acid sequence of the lipase 

enzyme as purified from the .P. gl.vmae culture broth has 
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been identified as AlaAspThrTyrAlaAlaThrArgTyrProVal- 
IleLeuValHisGlyLeuAlaGlyThrAspLys (= ADTYAATRYPV- 
ILVHGIAGTDK) . This amino acid sequence is encoded by 
nucleotides 118-183 (Fig. 2) . Firstly, from these 
5 findings it can be concluded that the mature lipase 

enzyme is composed of 319 amino acid residues, and has a 
calculated molecular weight of 33 f 092 dalton. 
Secondly, the enzyme is synthesized as a precursor, with 
a 39 amino acid residue N-terminal extension (numbered - 

10 39 to -1 in Fig. 2). 

From the scientific literature it is well known that 
most excreted proteins are produced intracellular as 
precursor enzymes (16) - Most commonly these enzymes have 
a N-terminal elongation, the so-called leader peptide or 

15 signal sequence. This peptide is involved in the initial 
interaction with the bacterial membrane. 
General features of the signal sequence as it is found 
in gram negative bacteria are: 

20 1. an amino-terminal region containing (on average) 2 
positively charged amino acid residues; 

2. a hydrophobic sequence of 12 to 15 residues ; 

3. a cleavage site region, ending with serine, alanine 
or glycine 

25 4. the total length is approximately 23 amino acids. 
Surprisingly, the lipase signal sequence comprises 39 
amino acids, which is rather long. Furthermore, it 
contains four positively charged amino acids at the N- 
terminus . 

30 For gram negative bacteria, this seems to be an 
exceptional type of signal sequence* 

Isolation of genes from other organisms, encoding 
related lipases. 

35 

As mentioned earlier, the p. glumae lipase belongs to a 
group of immunologically related lipases. From this it 
can be expected that these enzymes, although produced by 
different organisms, contain stretches of highly 
40 conserved amino acids sequences. 

As a consequence there has to be certain degree of 
homology in the DNA-sequence . 

Having the P. glumae lipase gene at our disposal, it is 
easy to isolate related lipase genes from other 
45 organisms. 

This can be done in essentially the same way as 
described above. 

From the organism of interest a gene bank (for example 
in a cosmid or phage Lambda) is made. This genome bank 
50 can be screened using (parts of) the -2.2 kb BamHI 

fragment (described above) as a probe. Colonies giving a 
positive signal, can be isolated and characterized in 
more detail. 
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EXAMPI£ 2. CONSTRUCTION OF THE LIPASE NEGATIVE P. 
GUUHAE STRAINS PG2 AND PG3. 

The construction of PG2 f from which the lipase gene has 
been deleted; and PG3, in which the lipase gene has been 
replaced with a tetracycline resistance (Tc-res) gene, 
comprises three main steps. 

A - construction of pUR6106 and pUR6107 (in E. coll), 
starting from PTJR6001 (see example 1) : 

pUR6001 contains a B&mHZ fragment from the P . glumae 
chromosome of ~2»2kb. The lipase gene (1074 base pairs) 
situated on this fragment, has a 5'- and a 3'- flanking 
sequence of -480 and -660 base pairs, respectively. 
Subsequent construction steps were: 

a. partial digestion of pUR6001 (isolated from E, coll 
KA816 , dcm—6, thr , leu, thi , LacY, galK2 , 

ara-14 , tonA31, supE44) (also named 

GM418 [17]) with Clal, to obtain linearized 
plasmids 

b. phenol extraction and ethanol precipitation (7) of 
the DNA, followed by digested with PscZ 

c. isolation of a 4,5 kb plasmid DNA fragment (having 
Clal and a P*tl sticky ends) , and a PstX fragment 
of -670 bp from agarose gel after gel electropho- 
resis followed by electro-elution in dialysis bags 
(7) 

d. the obtained plasmid DNA fragment with a Clal and 
a PstJL sticky end was ligated with a synthetic 
linker fragment (shown below) , with a Clal and a 
Pstl sticky end. 

Clal CGATC3U3VTCITCAro PstX 
TACTCTAGAACTAGIG 
This synthetic fragment contains a recognition site 
for the restriction enzymes Bell and Bglll. 
After transformation of the ligation mixture to E. 
coll SA101 (is JM101 with recA, hdsR) , selection 
on LB-Ap (100 /*g ampicillin/ml) agar plates, and 
screening of the plasmids from the obtained 
trans formants by restriction enzyme analysis, a 
correct plasmid was selected for the next construc- 
tion step* Upon digesting this correct plasmid with 
BamHI and jriadlll a vector fragment of -4kb and an 
insert fragment of -500 bp were found. 

e. the plasmid construct obtained as described in d. 
was digested with Fstl, and ligated together with 
the -670 bp PstI fragment isolated as described in 
c. 

f . transformation of the ligation mixture to E. coll 
SA101, selection on LB-Ap (100 m9 ampicillin/ml) 
agar plates, and screening of the plasmids from the 
obtained transf ormants . Since the PstI fragment 
can have two different orientations this had to be 
analysed, by. means of restriction enzyme analysis. 
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In the construct we were looking for, the orienta- 
tion should be thus that digestion with JBamHI 
results in a vector fragment of -4 kb and an 
insert- fragment of -1.2 kb . 
5 A representative of the correct plasmids is 

depicted in Fig- 3 and was called pUR6102. 

g. pUR6102 was digested to completion with BglXl 

h. pBR322 (18) was digested to completion with Aval 
and KcoRI, after which the DNA fragments were 

10 separated by agarose gel electrophoresis* A 

fragment of -14 35 base— pairs, containing the 
tetracycline resistance gene was isolated from the 
gel by electro-elution 

i. upon filling in the sticky ends (in a buffer 

15 containing 7 mM tris-HCl pH7.5, 0.1 mM EDTA, 5 mM 

/3-mercapto-ethanol, 7 mM MgCl 2 , 0.05 mM dNTPs and 
0.1 <f>/pl Klenow polymerase) of the DNA fragment 
containing the Tc-res gene and the linearized 
ptJR6102 they were ligated. 

20 j. transformation of E. coll SA101 with ^the ligation 
mixture, selection on LB-Tc (25/ig tetracyc line/ml) 
agar plates, and screening of the plasmids from the 
obtained trans formants by restriction enzyme 
analysis. 

25 The construction route of pUR6l02 and pUR6i03 is 

depicted in Fig. 3. 
k. pUR6102 was digested with BamHl and pUR6103 was 

partially digested with BamHl; the obtained 

fragments were separated by agarose gel electropho- 
3 0 resis and the desired fragments (-1145 bp and -2550 

bp resp.) were isolated out of the gel by electro- 

elution . 

1. pR2l02 (19) was digested to completion with BamHI 
and ligated to the BamHI fragments obtained in 
35 step k. 

m. transformation of the ligation mixtures to E.- coli 
S17-1 (20) , selection on LB-km,Tc (25 /*/ml each) 
and screening of the plasmids from the obtained 
transformants, by restriction enzyme analysis. The 
40 resulting plasmids were called pUR6106 and pUR6107 

(Fig. 4), respectively. 



45 



50 



B - Deletion of the lipase gene of the P. glumae 
chromosome . 

a. Introduction of pUR6106 in P. glumae via biparental 
conjugation with E, coli S17-1 (pUR6106) (which is 
the notation for £. coli S17-1 containing plasmid 
PUR6106) . 



A P. glumae colony was transferred from a MME plate 
(0.2 g/1 MgS0 4 -7H 2 0, 2 g/1 Citrate-H 2 0, 10 g/1 
K 2 HP0 4 , 3.5 g/1 NaNH 4 HP0 4 . 4H 2 0 , 0.5% glucose and 
1.5% agar) to 20 ml Luria Broth (LB) culture medium 
55 and grown overnight at 30 °C. £\ coli S17- 

l(pUR6106) was grown overnight in 3 ml LB medium. 
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25 ftg/ml Km, at 37 °C. 

The "next day the P. glumse culture was diluted 1:1 
and grown for 4 to 5 hours at 30 °C until OD660 is 
2.0 * 2.5- E. coli S17-1 (pUR6106) was diluted 1:50 
5 and grown for 4 to 5 hours at 37 °C until OD660 is 

1.5-2.0 

For the conjugation 50 od units (1 unit - 1 ml with 
OD = 1) (20 to 25 ml) P. glumae cells and 2.5 OD 
units (1^2 - 1.6 ml) E. coll S17-1 (pUR6i06) were 

10 mixed and spun down for 10 min at 5,000 rpm (HS4- 

rotor) . The cell pellet was divided over 3 LB 
plates and incubated overnight at 3 0 °C. 
Subsequently the cell material was removed from the 
plate and re-suspended in 3 ml 0.9% KaCl solution 

15 and pelleted by centrifugation (10 min, RT, HB4- 

rotor, 4krpm) . 

The cell pellet was re-suspended in 1.8 ml 0.9% 
Nad solution and divided over 3 plates MME, 0.5% 
glucose, 1.5% agar, 50 /ig/ml kanamycin (Km) and 

20 grown at 30 °C. 

Since pUR6l06 does not replicate in P. glumae, Km 
resistant trans-con jugants can only be obtained by 
integration. In these strains the plasmid pUR6106 
is integrated into the bacterial chromosome by a 

25 single, recombination event at the 5'- or 3'- 

flanging region. Due to the fact that these strains 
still contain a functional lipase gene, their 
phenotype is lipase positive • 

30 b. Two such strains (PG-RZ21 and PG-RZ25) were 
selected for further experiments. 
To delete the plasmid and the functional lipase 
gene out of the chromosomal DMA, a second recom- 
bination event should take place. This can be 

35 achieved by growing said strains for several days 

on IiB-medium without Km (without selective 
pressure) , plate the cells on BYPO-plates (10 g/1 
trypticase peptone, 3 g/1 yeast extract, 5 g/1 beef 
extract, 5 g/1 NaCl, 7 g/1 KH 2 P0 4 , 50 ml/1 olive 

4 0 oil emulsion and 1.5% agar) in a density which 

assures separate colonies, and screen for lipase 
negative colonies. Upon plating these lipase 
negative colonies on selective plates (MME -KM 50 
j*g/ml) , they should not grow. A strain obtained in 

45 this way could be called PG-2. 

C - Replacement of the' lipase gene of the P. glumae 
chromosome by the Tc-res gene. 

50 a. Introduction of pUR6107 in P. glumae via 

conjugation with £. coli S17-1 (pUR6107) as 
described in B. Selection of trans-con jugants 
was performed at 3 0 °C on MME -medium contain- 
ing 50 /«g/ml Tc. 



55 



Trans-con jugants obtained in this way were dupli- 
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cated to BYPO-plates containing 50 fig/ml Tc and to 
MME-plates containing 10 o /*g/ml Km. Several trans- 
con jugants exhibited a Km sensitivity (no growth on 
MME Km-100 plates) and lipase negative (no clearing 
5 zone on BYPO-plates) phenotype. Due to a double 

cross over (at the 5'- and at the 3 '-flanking 
region) the lipase gene was replaced by the Tc 
resistance gene. 

One representative strain was selected for further 
10 investigation and was called PG-3 . 



EXAMPLE 3. CONSTRUCTION OF A SYNTHETIC GENE ENCODING 
P. GLUMAE ( PRE) -LIPASE, 

15 

Based on the nucleotide sequence of the p. glumae (pre) - 
lipase gene a new gene was designed , containing several 
silent mutations. Due to these mutations the amino acid 
sequence of the enzyme was not changed. It was however 

20 possible to lower the GC-content, which facilitates 

enzyme engineering and enabled us to use the synthetic 
gene in a variety of heterologous host systems. 
An other point, facilitating enzyme engineering, was the 
possibility to introduce restriction enzyme recognition 

25 sites at convenient positions in the gene. 

The sequence of the new gene is given in Fig- 5(A) . 

The new gene was divided in restriction fragments of 
30 approximately 200 nucleotides, so-called cassettes. An 
example of such a cassette is depicted in Fig. 6. 

Each cassette was elongated at the 5' and 3' end to 
create an EcoKl and ITindlll site respectively. 

35 

The coding strands of these cassettes were divided in 
oligo-nucleotides (oligos) with an average length of 33 
bases. The same was done for the non coding strands, in 
such a way that the oligos overlapped for - 50% with 
40 these of the coding strand. 

The oligos were synthesized as described in example 1. 

Before assembling the fragments, the 5' ends of the 
45 synthetic oligos had to be phosphorylated in order to 
facilitate ligation. Phosphorylation was performed as 
follows: 

Equimolar amounts (50 pmol) of the oligos were pooled 
and kinated in 4 0 ^1 reaction buffer with 8 Units 
50 polynucleotide kinase for 30-45 minutes at 37 °c. The 
reaction was stopped by heating for 5 minutes at 70 °c 
and ethanol precipitation. 

Annealing was done by dissolving the pellet in 30 fil of 
55 a buffer containing: 7 mmol/1 Tris-HCl pH 7.5, 10 mmol/1 
2-mercapto-ethanol, 5 mmol/1 ATP were added. 
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Subsequently the mixture was placed in a water bath at 
65 °C for 5 minutes, followed by cooling to 30 °C over a 
period of 1 hour- MgCl 2 was added to a final concentra- 
tion of 10 mmol/1* T4 DNA-Ligase (2.5 Units) was added 
5 and the mixture was placed at 37 °C for 30 minutes or 

o/n at 16 °C. After this the reaction mixture was heated 
for 10 minutes at 70 °C- 

After ethanol precipitation the pellet was dissolved in 
digestion buffer and cut with Eco'Rl and J?indIII. 

10 

The mixture was separated on a 2% agarose gel and the 
fragment with a length corresponding to the correctly 
assembled cassette was Isolated by electro-elution. 

15 The fragments were ligated in pEMBL9 (digested with 

EcoRI/JTlndlll) as described in example 1, and they were 
checked for correctness by sequence analysis. In 
subsequent cloning steps the various cassettes were put 
together in the proper order, which resulted in pUR6038. 

20 This is a pEMBL9 derivative containing the complete 
synthetic lipase gene. 

To be able to njake the constructions as described in 
example 4 r a second version of the synthetic gene was 
25 made, by replacing fragment 5. In this way construct 
pTJR6600 was made, having the 3' PstI site at position 
1069 instead of position 1091 (See Fig, 5B) . 



3 0 EXAMPLE 4. INTRODUCTION OF THE (WILD TYPE) SYNTHETIC 
LIPASE GENE IN THE LIPASE NEGATIVE P. GLUMAE PG3 . 

In order to test whether the synthetic lipase gene is 
functional in P. glumae, the gene was introduced in 
35 strain PG3 . 

To simplify fermentation procedures, it was decided to 
stably integrate this gene in the PG3 chromosome, rather 
than introducing on a plasmid. 

For this reason the synthetic lipase gene had to be 
40 equipped with the 5' and 3' border sequences of the 
original p. gluaae lipase gene. 

This was achieved in the following way (see Fig. 7) : 

a. From' pUR6002 (ex JET. coll KA816) a vector with ClaT 
and PstJ. sticJcy ends was prepared in the same way 

45 as described in example 2. 

b, pUR6§00 (ex £. coll KA816) was digested to 
completion with Cla.1 and partial with Pstl. After 
separating the fragments by agarose gel electropho- 
resis a fragment of -1050 bp was isolated. 

50 c. The fragment thus obtained, was ligated in the 
pUR6002 derived vector and used to transform E. 
coll SA101'; in this way construct pUR6603 was 
obtained. 

d. pUR6603 was digested to completion with BamHI . 
55 After separating the fragments by agarose gel 

electrophoresis a fragment of ~2-2kb was isolated. 
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This fragment contains the synthetic lipase gene 

with the 5' and 3' flanking regions, of the wild 

type P. gladioli lipase gene, 
e. pRZ102 was also digested to completion with BamHI. 
5 f. The 2.2 kb fragment obtained in d. was ligated in 

pR2102 as described in example 2. 
g« The resulting construct , pUR6131 was transferred to 

£. coii S17-1. 
Integration of this construct in the chromosome of PG3 
10 was accomplished in the same way as described for 
pUR6106 in example 2 section B-a- 

From the obtained Km-resistant trans-con jugants , several 
were transferred to BYPO plates. They all appeared to 
15 have the lipase positive phenotype, since clearing zones 
occurred around the colonies * A typical representative 
was called PGL26. 

Obviously the same route can be followed to integrate 
construct (pUR6131) in a lipase negative P. glumae PG2 
20 (see example 2B-b) strain . 

From the examples 2 and 4 it might be clear that the P . 
glumae strain PG1 (and derivatives thereof, e.g. PG2 
and PG3; or derivatives of PG1 obtained via classical 
25 mutagenesis having an improved lipase production) can be 
manipulated easily by deleting or introducing (homolo- 
gous or heterologous) DNA fragments in the bacterial 
chromosome • 

By using these techniques it is possible to construct a 
30 strain optimized for the production of lipase. In this 
respect one could think of: 

- replacing the original lipase promoter, by a 
stronger ( inducible ) promotor , 

- introduction of more than one copy of the lipase 
35 gene (eventually, encoding different lipase 

mutants) , 

replacing the original promoter, or introduction of 
more copies of genes encoding functions involved in 
the production and excretion of the lipase enzyme 
40 (eg. chaperon proteins, "helper proteins" involved 

in the export of the lipase enzyme) , 

- deletion of the gene encoding extracellular 
protease (A Tn5 mutant of PG1 (PGT89) which does 
not produce a clearing zone or skimmilk plates has 

45 been deposited) , 

- manipulating the rhamnolipid production. 



EXAMPLE 5. PRODUCTION OF MUTANT LIPASE GENES AND 
50 TKEIR INTRODUCTION IN PG3. 

To improve the lipase, it is necessary to have the 
possibility to introduce well-defined changes in the 
amino acid sequence of the protein . 
55 A preferred method to achieve this is via the replace- 
ment of a gene fragment of the synthetic gene encoding 
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wild type lipase or of the wild type P. glumae lipase 
gene, with a corresponding chemically synthesized 
fragment containing the desired mutation. 
In the case of the synthetic wild type lipase gene, a 
5 cassette (or fragment thereof) can be replaced with a 
corresponding cassette (or fragment thereof) containing 
the desired mutation. 

The cassette, comprising the codon(s) for the amino 
10 acid(s) of interest, was assembled once more (as 

described in Example 3) . This time however, the oligos 
of the coding and the non-coding DNA strands, comprising 
the codon(s) of interest, were replaced by oligomers 
with the desired mutation* The new oligos were syn- 
15 thesised as described in Example 1. 

The thus obtained mutant cassette, or a fragment thereof 
was introduced at the corresponding position in the 
synthetic wild type lipase gene of constructs like 
ptTR6038 or pTJR6603. 
20 To introduce a synthetic mutant lipase gene in PG2 or 
PG3, the route as described in Example 4 has to be 
followed, starting at step d. 

A typical example of the production of a mutant gene is 
25 described below. In this case the His at position 154 of 

the wild type lipase gene has been replaced by a Pro. 

To accomplish this, two new oligomers were synthesized. 

The codon encoding amino acid 154 of the mature lipase 

is changed to CCT. 
3 0 These oligomers were used to assemble fragment 3 (H154P) , 

as described in example 3- After cloning the fragment in 

pEMBL9, the DNA sequences was determined as described in 

example 1. The thus obtained construct was called 

TO 6 071. 

35 Plasmid pUR6071 was digested to completion with FspX 

and Sail. Upon separation of the obtained DNA fragments 
via gel electrophoresis {as described in example 2), a 
fragment of -90 bp was isolated out of agarose gel. 
pUR6002 was partially digested with Fspl and partially 

40 with Sail. After gel electrophoresis a vector of -6000 
nucleotides was isolated out of the agarose gel in 
example 2. 

The isolated -90 bp fragment was ligated in the pUR6002 
vector to obtain pUR6077A. 
45 The BonxKl fragment (-2200 bp) of pUR6077A was ligated 
in pRZ102 as described in examples 3 and 4, In this way 
pUH6127 was obtained. 

Introduction of this construct into the chromosome of 
PG3 was accomplished as described in example 4 . A 
50 resulting lipase producing p. glumae trans-con jugant , 
was called PGL24 . 

The modified lipase produced by this strain proved to be 
significantly more stable than the parent lipase in an 
actual detergents system (Fig. 8) . 
55 In essentially the same way several other mutant lipase 
genes have been made. In some cases this resulted in a 
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altered net charge of the encoded protein (eg. D157R 
(+2) , D55A (+1), I110K (+1), R61P (-1), T109D (-1), R8D 
(-2))- In other cases amino acids have been introduced 
or deleted (eg. PGL40 in which 152S-154H has been 
5 replaced by AlaLeuSerGlyHisPro = ALSGHP) . 

Furthermore potential glycosylation sites have been 
removed (eg. N4BS and/or N238S) and/or introduced (eg. 
D157T and insertion of G between N155 and T156) . 

10 

EXAMPLE 6, EXPRESSION OF THE SYNTHETIC LIPASE GENES IN 
SAC CBAROMYCES CEREVISIAE USING AUTONOMOUSLY REPLICATING 
PLASMXDS . 

15 To illustrate the production of P . glumae lipase by 

eukaryotic micro-organisms, vectors suited for expres- 
sion of P. glumae lipase in the yeast cerevisiae 
using the GAL7 promoter (21) were constructed. The p. 
glumae lipase is produced by the yeast 5. cerevisiae 

20 using two different expression systems. An expression 
system based on autonomously replicating plasmids with 
the lipase expression cassette and an expression system 
based on multicopy integration of the lipase expression 
cassette in the host genome. 

25 

The plasmid pUR2730 (21) was used as the basis for the 
lipase expression plasmids. The plasmid pUR2730 consists 
of the GAL7 promoter, S. cerevisiae invertase signal 
sequence, a-galactosidase gene (the a-galactosidase 
30 expression cassette) , 2/trn sequences for replication in 
S. cerevisiae , the LEU2d gene for selection in 5. cerev- 
isiae and pBR322 sequences for replication and selection 
in E m coll. 

The plasmid pUR6038 was used as the source for the 
35 lipase gene. 

The following s. cerevisiae expression plasmids were 
constructed , encoding : 

40 l. mature lipase preceded by the invertase signal 
sequence (pUR6801) , 
2. mature lipase preceded by a KEX2 cleavage site, a 
glycosylation site and the invertase signal 
sequence (pUR6802) . 

45 

In order to obtain the above mentioned constructs, the 
routes followed were (Fig. 9; the used restriction 
recognition sites are marked with an asterisk) : 

50 ad 1 and 2 . 

a. The plasmid pUR2730 was digested with Sad and 
Hindlll and the vector fragment was isolated. 

55 b. The plasmid pUR603 8 was digested with £coRV and 

Hln dill and the fragment with the lipase gene was 
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isolated. 

c. Synthetic Sacl-EcoRV DNA fragments were synthesized 
and? constructed as described in example 3, 
consisting of the following sequences: 

5 

In the case of pUR6801: 
I. 

5 ' CATCACACAAACAAACAAAACAAAATGATGCTITTGCAAGCCTTCCTTTTCCTT- 
10 3' TCGAGTAGTGTGrrrGlTrGITrrGTITlACTACGAAAACGTTCGGAAGGAAAAGGAA- 

- TTGGCTGGTTTTGCAGCCAAAATATCTGCCGCGGACACATATGCAGCTACGAGAT 3 ' 

- AACCGACCAAAACGTCGGTTTTATAGACGGCGCCTGTGTATACGTCGATGCTCTA 5 ' 

15 This fragment gives a correct junction of the GAL7 
promoter and the lipase gene with in between the 
sequence encoding the invertase signal sequence. 

In the case of ptTR6802: 

20 

11 • 

5 ' CATCACACAAACAAACAAAACAAAATGATG C 1 Tl ' T GCAAGCCTTCCTTTTCCT - 

3 ' TCGAGTAGTGTGTTTGTTTGTTTTGTTTTACTACGAAAACGTTCGGAAGGAAAAGGA- 

25 -TTTGGCTGGTTTTCCAGCGAAAATATCTGCCTCCGGTACTAACGAAACTTCTGATAA- 
- AAAGCG AGGAAAACGTCGGTTTTATAG ACGGAGGCCATG ATTGCTTTG AAGACTATT - 



-GAGATGAAGCGAAGCTGCTGACACATATGCACCTACGAGAT 3 • 

- CTCTCTTCGACTTCGACGACTGTGTATACGTCGATGCTCTA 5 ' 



30 



This fragment gives a correct junction of the GAI/7 
promoter and the lipase gene with in between the 
sequences encoding a KEX2 cleavage site, a glycosylation 
site and the invertase signal sequence. 

35 

d. The SacI-£UndIII vector fragment, one of the Sacl- 
£coRV synthetic fragments (I) and the EcoRV- 
ffiudlll 0&A fragment with the lipase gene were 
lig&ted. For the construction of pUR6801 this is 

40 shown in Fig. 9. (pUR6802 is constructed in the 

same way, using synthetic fragment II) 

e. The ligation mixture was transformed to E. coll. 
From single^ colonies, after cultivation, the 
plasmid DNA was isolated and the correct plasmids, 

45 as judged by restriction enzyme analysis, were 

selected and isolated in large amounts. 

f . The plasmids pUR6801 and pUR6802 were transformed 
to S. cerevisiae strain SU10 (21) using the 
spheroplast procedure (22) using selection on the 

50 presence of the IiEU2d gene product. 

g. The transf ormants were grown overnight in defined 
medium ( 0,68% Yeast Nitrogen Base w/o amino acids, 
2% glucose, histidine and uracil) , diluted 1 : 10 
in induction medium (1% yeast extract, 2% bacto- 

55 peptone, 5% galactose) and grown for 4 0 

hours . 
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h- The cells were isolated by centrif ligation and cell 

extracts were prepared (23) . 
i. The cell extracts were analysed by SDS-gel- 

electrophoresis (7) and blotted on nitrocellulose. 
5 j . The nitrocellulose blots were incubated with lipase 

antibodies and subsequently with 1125 labelled 

protein A followed by fluorography (Fig. 10). 

As shown in Fig. 10, SU10 cells containing the plasmid 
10 pUR6801 produce lipase enzyme with the correct molecular 
weight as compared to lipase from p. glumae. In addition 
to the correct protein also not processed and glycosy- 
lated lipase protein can also be seen. The P. glumae 
lipase produced by cerevisiae is enzymatically 
15 active. 



EXAMPLE 7. PRODUCTION OF P. GLUMAE LIPASE BY 
CEREV1SIAE USING MULTICOPY INTEGRATION. 

20 

The multi-copy integration vector was derived from the 
plasmid pARESG (24) by replacing the 335 bp yeast RNA 
polymerase I promoter element with the 4.5 B^III B 
fragment of 5. cerevx&iae rDNA (25). Also the origin 

25 of replication was removed and the B^lII-ffindlll DNA 
fragment comprising chloroplast DNA from s. oligorhiza 
was replaced by a poly linker DNA sequence. This resulted 
in plasmid pUR2790 from which a detailed picture is 
shown ~n Fig. 11. 

30 The essential sequences for multicopy integration in the 
yeast genome of pUR2790 are: 1. rDNA sequences for 
multicopy integration in the yeast genome, 2. the s. 
cerevisiae LEU 2d gene (26) ; this is the LEU 2 gene with a 
deficient promoter. 

35 

Amongst others, the following multicopy integration 
expression plasmids were* constructed, encoding: 

1. mature lipase preceded by the invertase signal 
40 sequence (pUR6803) , 

2. mature lipase preceded by a KEX2 cleavage site, a 
glycosylation site and the invertase signal 
sequence (pUR6804) » 

In order to obtain the above mentioned constructs, the 
45 routes followed were (Fig. 12; the used restriction 
recognition sites are marked with an asterisk) : 

ad 1 and 2. 

50 a. The plasmid pUR2790 was partially digested with 
jyindlll. The linear plasmid was isolated d 
digested to completion with B^III and the ffindlll- 
Bgl II vector fragment was isolated by agarose gel- 
el ectrophores is and electro-elution. 

55 b. The plasmid pUR68 01 was digested partially with 
Bgl II and to completion with flindlll and the 
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BglTl-H indlll DNA fragment with the lipase gene was 
isolated (pUR6804 is constructed in the same way 
using plasmid pUR6802 instead of pUR6801) . 

c. The Bg 1TX-H in dJTl vector fragment of pUR2790 and 
the Bglll-JTindlll fragment with the lipase gene 
were ligated (Fig. 12), resulting in plasmid 
pUR6803 . 

d. The ligation mixture was transformed to E. coll. 
From single colonies, after cultivation, the 
plasmid DNA was isolated and the correct plasmids, 
pUR6803 and pUR6804, as judged by restriction 
enzyme analysis were selected and isolated in large 
ounts. 

e. The plasmids pUR6803 and pUR6804 were transformed 
to S. cerevisxae Strain YT6-2-1 L (26) = SU50 with 
the spheroplast procedure (22) using selecting for 
the presence of the LEU2d gene product. The host 
strain SU50 is deficient for the essential nutrient 
leucine (IiEU2) , which means that strain SU50 is not 
capable of producing leucine. Thus it can only grow 
when the growth medium contains sufficient amounts 
of leucine. 

The deficient promoter of the LEU2 gene present in 
vectors pUR6803 and pUR6804 is essential for 
multicop^ integration of the plasmid vectors in the 
yeast genome. The multicopy integration occurs at 
the rDNA locus of the yeast genome due to homolo- 
gous recombination of the rDNA sequences of the 
plasmids and the rDNA sequences of the yeast 
genome. 

f . The integrants were grown overnight in defined 
medium ( 0,68% Yeast Nitrogen Base w/o amino acids, 
2% glucose, histidine and uracil) , diluted 1 : 10 
in induction medium (1% yeast extract, 2% bacto- 
peptone, 5% galactose) and grown for 40 hours. 

g. The cell§ were isolated by centrifugation and cell 
extr acts , were prepared (23) . 

h. The cell extracts were analysed by SDS-gel- 
electrophoresis (7) and blotted to nitrocellulose 
filters. 

i. The nitrocellulose blots were incubated with lipase 
antibodies and subsequently with 1125 labelled 
protein A followed by fluorography (Fig. 13). 

As shown in Fig, 13, integrants of SU50 with the plasmid 
pUR6803 produce lipase enzyme with the correct molecular 
weight as compared to lipase from P . glumae. In addition 
to the correct protein, not processed and glycosylated 
lipase protein can also be seen. The p. glumae lipase 
produced by s. cerevlsiae is enzymatically active. 

In this way yeast strains have been obtained carrying 
multiple integrated copies (up to 100 copies per haploid 
genome) of eitier the plasmid pUR6803 or pUR6804 
(including the lipase expression cassette) for the 
production of active p. glumae lipase. This multicopy 
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integration system is stable even under non-selective 
conditions . 



5 EXAMPLE 8. PRODUCTION OF GUAR a-GALACTOSIDASE 

IN SACCEAROHYCES CEREVISIAE USING MULTICOPY INTEGRATION 

In this example the expression of a heterologous 
protein, a-galactosidase from guar {Cyamopsis tetrago- 

10 no lob a) in Sac charomyces cerevis xae r using multicopy 
integration , is described- The gene encoding guar a- 
galactosidase was fused to homologous expression 
signals as is described by Overbeeke (21) resulting in 
the expression vector pUR273 0. The a-galactosidase 

15 expression cassette of pUR2730 consists of the s. 

cerev-lside GAL7 promoter, the 5. cerevisiae invertase 
signal sequence and the a-galactosidase gene encoding 
mature a-galactosidase. The multicopy integration vector 
used is pUR2770, which is identical to pMIRY2.1 (27), 

20 The a-galactosidase expression cassette was isolated and 
inserted in the multicopy integration vector pUR2770 
resulting in pUR2774. This multicopy integration vector 
contains the a-galactosidase expression vector, S. 
cerevisiae ribosomal DNA sequences and the s. 

25 cercvisiae deficient LEU2 gene (LEU2d) as a selection 
marker. The multicopy integration vector was 
transformed to 5. cerevisiae and multicopy integrants 
were obtained. The multicopy integrants were mitotically 
stable and the multicopy integrants expressed and 

3 0 secreted the plant protein a-galactosidase. This example 
clearly demonstrates that it is possible to obtain 
multicopy integration in the genome of s. cerevisxae and 
that the multicopy integrants can be used for the 
production of proteins. All DNA manipulations were 

35 carried out as described in Maniatis (7) . 

X. Construction of multicopy integration vector 
PUR2774. 

40 The multicopy integration vector pUR2770 was partially 
digested with Hindlll and the linearized vector fragment 
was isolated. The linear vector fragment was digested to 
completion with BamHI and the resulting 8 kb vector 
fragment was isolated. The a-galactosidase expression 

45 cassette was isolated from pUR2730 by digestion with 
JTindlll and B^III and isolation of the 1.9 kb DNA 
fragment. The a-galactosidase expression cassette was 
ligated in the isolated vector fragment of pUR2770 
resulting in the multicopy integration vector pUR2774 

50 (see also Fig. 14). The ligation mixture was transformed 
to E, oolL. From single colonies, after cultivation, the 
plasmid DNA was isolated and the correct plasmids, as 
-judged by restriction enzyme analysis, were selected and 
isolated in large amounts. The multicopy integration 

55 vector pUR2774, linearized with S ma I .was transformed to 
the S. cerevislae strain YT6-2-1 L (26) using the 
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spheroplast method (22) by selecting for the presence of 
the LEU2d gene product. 

2. Analysis of the integration pattern of the 
5 multicopy integrants - 

The ribosomal DNA of 5. cerevlsl&e is present in ± 150 
identical copies of the rDNA unit, comprising the genes 
that specify the 17S, 5.8S and 26S rRNA components of 
the ribosomes of S. cerevisiae These rDNA units are 

10 tandemly repeated in a large gene cluster on chromosome 
XII of 5* cerevlsiae. The complete sequence of the rDNA 
unit is known, the rDNA unit is 9.0 kB large and 
contains two Bglll sites (28, 29, 30). When chromosomal 
DNA isolated from S. cerevisiae is digested with Bglll, 

15 the rDNA gene cluster gives rise to two fragments, with 
a length of 4,5 kb. This gene organization is schemati- 
cally represented in Fig. 15A. The 4.5 kb band cor- 
responding to the ribosomal DNA fragments is detectable 
in the restriction pattern on an ethidium-bromide 

20 stained agarose gel, because of the large number of 
ribosomal DNA units present in a haploid genome. The 
plasmid pUR2774 has a length of 9.8 kb and contains one 
single B^III restriction enzyme recognition site. If 
plasmid pUR2774 is tandemly integrated in a high copy- 

25 number, digestion of the chromosomal DNA with BglXX will 
give rise to a 9.8 kb DNA fragment. With an ethidium- 
bromide stained agarose gel a comparison can be made 
between the intensity of the 4.5 kb DNA band, cor- 
responding to + 150 copies of the ribosomal DNA unit, 

3 0 and the 9.8 kb DNA band, derived from the integrated 

plasmid. This gene cluster organization is shown in Fig. 
15B. This comparison will give a reasonable estimation 
of the number of integrated pUR2774 plasmids. pUR2774 
was linearized and transformed to the yeast strain YT6- 

35 2-1 L (SU50) which is LEU2~. Trans formants were streaked 
on MM (defined) -medium without leucine for an extra check 
for the LEU2 + phenotype. To examine whether integration 
of the multi-copy vector pUR2774 actually occurred, 
chromosomal DNA was isolated from independent integrants 

40 SU50A, SU50B, SU50C and SU50D. The total DNA was 

digested with S^lII and analyzed by gel -electrophoresis. 
An example of such a ethidium-bromide stained gel is 
shown in Fig. 16. As expected, in the restriction 
patterns of integrants SU50B and SU50C, two main bands 

45 can be distinguished at 4.5 and 9.8 kb. The parent 

strain gives to a single band only of 4.5 kb; the rDNA 
unit. So, we can conclude that in addition to the 
multiple ribosomal DNA units, these integrants surpris- 
ingly also contain multiple integrated copies of the 9.8 

50 kb plasmid pUR2774. Different multicopy integrants were 
found to contain different copy-numbers of the plasmid 
pUR2774 varying from 10 to 100. To confirm the presence 
of the et-galactpsidase gene, hybridization with radio- 
labelled probe was performed. The probe for the a- 

55 galactosidase gene was isolated from pUR2731, a pUR2730 
derivative, by digestion with FvuII and ffindlll and 
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isolation of the 1-4 kb fragment containing the a- 
galactosidase gene. To identify the rDNA sequences a 
probe was prepared by digestion of pUR277 0 with Smal 
and Hindlll, followed by isolation and labelling of the 
5 2.0 kb fragment. Upon hybridisation with the a- 

galactosidase probe (see Fig. 17) it was found that the 
9.8 kb band as detected in the ethidium bromide stained 
gel indeed corresponds to the a-galactosidase gene since 
this single band was present in the autoradiographs, 

10 while in the lane containing digested total DNA from the 
YT6-2-1 L parent strain no hybridisation signals were 
detected. Since we could detect the 9.8 kb band no 
extensive re-arrangements and/or deletions can have 
occurred in the integration process. Hybridisation with 

15 the rDNA probe resulted in signals corresponding to a 
4.5 kb band and a 9.8 kb band, from which it follows 
that indeed the 4.5 kb band contains the expected 
ribosomal DNA sequences. As proven with the cr- 
galactosidase probe the 9.8 kb band results from the 

20 multicopy integration of pUR2774. This 9.8- kb DNA band 
also gives a positive signal with the rDNA probe, 
because pUR2774 also contains ribosomal DNA sequences. 
From the results shown in Fig. 18 # assuming that the 4.5 
kb ribosomal DNA band represents 150 copies of the rDNA, 

25 the 9.8 kB band containing pUR2774 can be estimated to 
contain 50-100 copies. Thus, by transformation of the 
multi-copy integration plasmid pUR2774, it is indeed 
possible to direct 50-100 copies per cell of the <*- 
galactosidase expression cassette to the genome of s. 

3 0 cerevisiae. 

3. Production of a-galactosidase by multicopy 
integrants . 

35 Multicopy integrants SU50A, SU50B, STJ50C and SU50D 

chosen for having high copy numbers of the integrated 
plasmid were examined for a-galactosidase activity by 
growing them on 0.67% Yeast Nitrogen Base w/o amino 
acids, 2% glucose overnight, followed by induction of 

40 the GAI/7 promoter by a 1:10 dilution in 1% Yeast 

Extract, 2% Bacto-peptone , 2% galactose (YPGal) . The a- 
galactosidase activity in the supernatants of the 
cultures was determined by means of an enzyme activity 
test, as described by Overbeeke et al. (21), at 24 and 

45 48 hours after start of the induction. The results are 
shown in the following table: 
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24 hours induction 48 hours induction 

Integrant OD660 a -gal OD660 a -gal 

5 mg/1 mg/1 



SU50A 2%gal . 


. 9 


58 


8 


82 


10 SU50B 2%gal 


• 15 


101 


13 


235 


SU50C 2%gal 


6 


45 


4 


101 


SU50D 2%gal 


14 


41 


10 


54 



15 

The results shown in the table clearly demonstrate that 
it is possible to obtain high levels of expression of a 
foreign gene using a multicopy integrant* Moreover, 
SU50B (235 mg/1) gives rise to a higher level of ce- 

20 galactosadase production as compared to a expression 
system with extrachromosomal plasmids (see pUR2730 in 
reference 21) - In spite of the fact that all four multi- 
copy integrants had been elected for having a high copy- 
number of integrated <*-galactosidase expression 

25 cassettes, their expression levels vary from 54 to 23 5 
mg/1. 

4. Genetic stability of the SU50B multicopy integrant. 

30 To test whether integration of multiple copies a- 

galactosidase expression plasmids in the s. a&revisinc 
genome is genetically stable the complete test procedure 
was repeated under non-selective conditions. Integrant 
STJ50B was streaked on an YPD-agar plate and a pre- 

35 culture was inoculated and grown overnight in YPD at 30 
°C. Subsequen.^ly, the pre-culture was diluted 1:10, in 
YPGal* Samples were taken, optical density measured at 
660 rat ai)d. the er-galactosidase content of the culture- 
broth was determined by the enzyme activity assay. 

4 0 Surprisingly, "the expression level of <x-galactosidase 
was stable during the whole experiment. This experiment 
shows that indeed the multiple integrated expression 
plasmids are maintained very stable under non-selective 
conditions for inany generations. Another important 

4 5 finding was that the multi-copy integrants were stable 
for months on non-selective YPD-agar-plates kept at 4 
°C. When the pre-culture of the SD50B integrant is 
diluted 1:1000 in YP with 2% galactose, grown at 30 °C 
to an identical OD 660 nm, the or-galactosidase expres- 

50 sion is 250 mg/1. In this experiment the pre-culture of 
the multicopy integrant SU50B is diluted to a larger 
extend in YPGal, and the cells in the induced culture 
have to make more divisions before the same biomass and 
related to this the <i~galactosidase production is 

55 achieved as with a 1:10 dilution. Thus, we can conclude 
that the stability of the a-galactosidase production and 
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therefor the genetic stability of the multicopy 
integrants is very good compared to the stability of 
extrachromosomal plasmids under non-selective condi- 
tions. 

5 We have also found that for multicopy integration and 
genetic stability of the multicopy integrants the length 
of the multicopy integration vector is an important 
parameter. The use of multicopy integration vectors with 
a length of about 12 kb have a tendency to result in a 

10 lower copy number of integrated vectors in the genome 
and also a decreased genetic stability although still 
very reasonable. The use of relatively small multicopy 
integration vectors (± 3 kb) results in a high copy 
number of integrated vectors but with a decreased 

15 genetic stability. These results show that the optimal 
length for a multicopy integration vector, resulting in 
high copy number of integrated vectors and good genetic 
stability, is approximately the length of a single 
ribosomal DNA unit; for s. cerevxslae about 9 kb. 

20 

This example clearly demonstrates the feasibility of the 
use of multicopy integration in s. cerevisxae for the 
production of proteins* The high genetic stability of 
the multicopy integrant confer an important advantage as 

25 compared to the extrachromosomal plasmid-system where 
cells have to be grown under selective pressure. The 
multicopy integrants appeared to be very stable on YPD- 
agar plates as well as during growth in YPD- and YPGal 
culture medium for many generations. Considering the 

30 high level of expression of the a-galactosidase enzyme 
and the good mitotic stability of the integrated «- 
galactosidase expression cassettes, this integrant- 
system is a realistic option for large-scale production 
of the a-galactosidase enzyme or any other protein. 



EXAMPLE 9 MULTICOPY INTEGRATION IN SACCRAROHYCES 

CEREVISIAE USING OTHER DEFICIENT SELECTION MARKERS. 

40 We have found that for multicopy integration in yeast 
there are two prerequisites for the multicopy integra- 
tion vector; the multicopy integration vector should 
contain ribosomal DNA sequences and a selection marker 
with a specific degree of deficiency. In the previous 

45 examples multicopy integration is obtained using a 
multicopy integration vector with ribosomal DNA 
sequences and the defective LEU2 gene (LEU2d) as a 
selection marker. In this example the use of other (than 
LEU2d) defective selection markers in order to obtain 

50 multicopy integration in yeast is described. In this 
example multicopy integration vectors are used with 
either a deficient TRP1 or a deficient URA3 instead of 
the LEU2d gene. The expression of both these genes was 
severely curtailed by removal of a significant part of 

55 their 5' flanking regions. Using these multicopy 

integration vectors, multicopy integrants were obtained 
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in which approximately 200 copies of the vector 
integrated. This example clearly demonstrates that 
multicopy .integration can also be effected by other 
deficient selection markers. All standard DNA manipula- 
5 tions were carried out as described in Maniatis (7) . 

1. Construction and analysis of pMIRY plasmids 

containing a deficient TRP1 gene as selection 
marker . 

10 

In order to test the possibility that multicopy 
integration into the genome can be obtained using 
different types of selection pressure during transforma- 
tion, series of pKIRY2 . 1-analogous plasmids (pMIRY2.1 is 

15 identical to pUR2770) were constructed, containing 
deficient .alleles of two genes commonly used as 
selection- markers: the TRPl and TOA3 genes of S. 
cerevisiae. The TRP1 gene encodes the enzyme 
N- (5 ' -phosphpribosyl~l) -anthranilate (PRA) isomerase , 

20 which catalyzes the third step in the biosynthesis of 
tryptophan (31) r Transcription of the TRPl gene is 
initiated at multiple sites which are organized into two 
clusters (Fig. 19) , one at about position -200 relative 
to the ATG start codon and the other just upstream of 

25 this codon (32)^ Each of the two clusters is preceded by 
putative TATA elements as well as (dA:dT)-rich regions 
that could act as promoter elements (3). When the 
upstream region of the TRP1 gene is deleted up to the 
EcoBX site at position -102 (TA1) , the first cluster of 

30 transcription start sites is removed and the expression 
level of the gene drops to only 20% to 25% of the value 
of its wild- type counterpart (31) . This particular 
deficient .TRPl allele is currently used as selection 
marker in several yeast vectors * We hypothesize that 

35 this degree of deficiency was not high enough and 

therefore the deletion in the 5 '-flanking sequence was 
extended to either position -30 (TA2) or -6 (TA3) 
upstream of the ATG codon- The Ta2 gene still contains 
part of the dowtistream cluster of transcription 

40 initiation sites. In the TA3 deletion mutant both 
clusters as welJL as all poly dA:dT stretches and 
putative TATA elements are deleted. These two mutant 
TRPl genes, as well as the original TA1 gene were used in 
construction of the pKIRY6-T series of plasmids. 

45 Construction of this series was carried out as follows 
(Fig. 20) : first, the 766 bp AccI-PstI (Fig. 19) 
fragment, containing the TRP1 coding region plus 3 0 bp 
of 5 '-flanking sequence, was cloned between the Saal and 
PstI sites of pUC19, resulting in plasmid pUC19-TA2 (the 

50 AccX site was made blunt by filling in the 3 '-end using 
T4 polymerase). Subsequently, the 3.5 kb SphX fragment 
from a pUC18 subclone containing the BglXl-B rDNA 
fragment (27) was inserted into the SphX site of the 
PUC19-TA2 fcolylinker giving plasmid pMIRY6-TA2. Plasmids 

55 pMIRY6-TAl and pKIRY6-TA3 are derivatives of pMIRY6-TA2. 
To obtain pMIRY.6-TAl, the 867 bp EcoRI-BgUI TRP1 



WO 91/00920 



PCT/EP90/01138 



35 



fragment was first cloned between the EcoRI and the 
BamWL sites of pUC19, giving plasmid pUC19-TAl. In the 
next step the 1.2 kb Scal-KcoRV fragment of pMIRY6-TA2 
which contains a portion of the pUC19 sequence as well 
5 as part of the TRP1 gene (Fig. 20) , was replaced by the 
1.3 kb 5caI-£coRV fragment from pUC19-TAl restoring the 
length of the 5' flanking sequence of the TRP1 gene to 
102 bp. pMIRY6-TA3 was constructed in a similar way. 
First, the 405 bp AlttX fragment from the TRP1 gene was 

10 cloned into the SmaX site of pUC19, giving pTJC19-TA3 . 
Subsequently, the 1.2 kb ScaI-£coRV fragment of 
PMIRY6-TA2 was replaced by the 1.2 kb ScaI-£coRV 
fragment from pUC19-TA3, to give pMIRY6-TA3. 
Plasmids, pMIRY6-TAl, pMIRY6-TA2 and pMIRY6-TA3 were 

15 transformed into yeast after linearization with SpaX 

within the rDNA sequence, in order to target integration 
to the rDNA locus. In Fig. 21 a gel electrophoretic 
analysis of total DNA is shown from two independently 
isolated trans formants of each type after digestion 

20 with EcoHV in the case of pMIRY6-TAl and Sad in the 

cases of pMIRY6-TA2 and pMIRY6-TA3. In the case of the 
PMIRY6-TA2 (lanes 3 and 4) and pMIRY6-TA3 (lanes 5 and 
6) transformants, the rDNA band and the plasmid bands 
are of comparable intensity. Thus, the copy number of 

25 each of the two plasmids is as high as the number of 
rDNA units per haploid genome, which is approximately 
150 (33) . The copy number of the pMIRY6-TA2 and -TA3 
plasmids is of the same order. In contrast, transforma- 
tion with pMIRY6 — TA 1 did not result in high-copy-number 

30 transformants. As shown in Fig. 21 (lanes 1 and 2) , no 
6.9 kb band corresponding to the linearized pMlRY6-TAl 
plasmid is visible upon Sac I digestion of the total DNA 
from pMIRY-TAl transformed cells. The results described 
above clearly demonstrate that multicopy integration 

35 into the yeast rDNA locus does not absolutely require 
the presence of the I*EU2d gene selection marker in the 
vector. Instead, deficient TRP1 alleles can be used, 
provided their expression falls below a critical level. 

40 2. Construction and analysis of a pMIRY plasmid with a 
deficient URA3 gene as selection marker. 

Next to the IiEU2 and the TRPl genes, the URA3 gene is 
one of the most widely used selection markers in yeast 

45 vectors (34) . The URA3 gene encodes orotidine-5' -phos- 
phate carboxylase (OMP decarboxylase) . The expression of 
this gene is controlled at the level of transcription by 
the PPR1 gene product which acts as a positive regulator 
(35) . Deletion analysis suggests that the sequence 

50 essential for PPR1 induction of URA3 is located in a 97 
bp long region located just upstream of the ATG 
translation start codon (36) . In order to obtain a 
promoter of the URA3 gene with the desired degree of 
deficiency we have deleted most of this region, using a 

55 PstI site located 16 bp upstream of the ATG start codon. 
To that end a .Bgljx linker was inserted in the Smal 
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site of pFLl (36B) located in the 3' flanking region of 
the URA3 gene at position +880 relative to the ATG 
translational start signal, yielding pFLl- Bglll. The 
0.9 Jcb Fstl-BglTX fragment, comprising the URA3 coding 
5 region together with its fianking 3' region abutted by 
the B^III site and only 16 bp of its 5' flanking region 
abutted by the PstT site, was cloned between the Pstl 
and BamKL sites of pUC19, yielding pUC19-U (Fig. 22). 
The 2.8 kb 5acl-5tul rDNA fragment containing part of 

10 the B^XII-B rDNA fragment, was isolated from pUOBR and 
inserted between the Soal and Sac I sites in pUC19~UA, 
giving plasmid.pMIRY7-UA. Copy number analysis of two 
independently isolated pMIRY7-UA transf ormants is shown 
in Fig. 23, The plasmid band and the rDNA band have 

15 similar intensities which means that the plasmid is 
integrated in ^bout 200 copies per cell, a result 
similar to that obtained with plasmids pMIRY6-TA2 and 
PMIRY6-TA3. This example clearly demonstrates that 
multicopy integration into the yeast ribosomal DNA locus 

20 is also effected using genes other than LEU2d as 

selection marker. Indeed, it seems likely that any gene 
involved in the: biosynthesis of a essential nutrient can 
support this process, when employed as selection marker 
in a pMIRY plasmid, provided that it is expressed but 

25 its expression is below a critical level. 

This means that surprisingly we have found that besides 
the ribosomal DNA sequence a deficient, but essential 
gene must be present on the multicopy integration vector 
in order to obtain multicopy integration, in a S. 

30 cerevlsiae strain deficient for that essential gene, of 
this multicopy integration vector and that the obtained 
multicopy integrants can be stable for many • 
generations. So the principle of multicopy integration 
can be extended to all S. cerevLsiae auxotrophic strains 

35 thus permitting 1 a choice from a range of host strains 

for the expression of any particular gene. Such a choice 
is an important factor in the optimization of 
heterologous gene expression in yeast. In particular Trp 
auxotrophy is ah attractive marker for use in an 

40 industrial process since even poorly defined media can 
easily be depleted of tryptophan by heat-sterilization. 



EXAMPLE 10 STABILITY OF THE MULTICOPY INTEGRANT SU50 
45 IN CONTINUOUS CULTURES . 

The multicopy integrant is cultivated in a continuous 
culture (chemostat) with a working volume of 800 ml at a 
dilution rate of 0.1 h" 1 (a mean residence time of 10 

50 hours) ♦ The integrant SU50B is a transf ormant of strain 
Saccharomyces cerevlslse CBS 235-90 with the multicopy 
integration vector pUR2774 (see example 8). The pH was 
controlled at 5.0 using 10% NH 4 0H. Foaming was sup- 
pressed using a silicon oil based antifoam (Rhodorsil 

55 4 26 R Rhone-Poulenc) The feed composition used was A. 
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A steady state was maintained for 12 0 hours with a 
stable expression of 360 mg/1 a-galactosidase at a 
biomass dry weight concentration of 11.06 g/1. Similar 
conditions were stable in other experiments for more 
5 than 500 hours. The residual glucose concentration was 
below the detection limit of 0.05 g/1. The residual 
galactose concentration was 4.2 +/- 0.1 g/1. The inlet 
contained 170 mg/1 leucine derived from the yeast 
extract and DHW. This resulted in a steady state leucine 

10 concentration of 2.0 +/- 0.4 mg/1 as determined with a 
amino acid analyzer. After a period of time, 50 mg/1 
leucine was added to the feed A. Surprisingly the 
residual leucine concentration in the culture dropped to 
0.7 +/- 0.2 mg/1. This was accompanied by a considerable 

15 decrease of a-galactosidase activity within 80 hours to 
144 mg/1 (Fig. 24) . In Fig. 25 the determination of the 
copy number during various stages of the experiments is 
shown, samples were taken, chromosomal DNA isolated, 
digested with Bglll and subsequently southern blotting 

20 was performed using the xibosomal DNA probe as described 
in example 8. SU50B 1 is a positive control grown in a 
shake flask. Clearly can be seen that the copy number of 
integrated vectors , by comparing the smaller hybridizing 
DNA fragment (chromosomal rDNA units: + 150 copies) 

25 with the larger hybridizing DNA fragment (the integrated 
vector), is about 1O0. This is the same for SU50B 2, a 
southern blot of a sample taken before the adding of the 
leucine. For SUB 0B 3, a sample taken after the adding of 
leucine and the drop in cr-galactosidase expression the 

3 0 copy number has decreased to about 10. This experiment 
shows that the decrease in a-galactosidase expression is 
accompanied by a decrease in copy number of the <*- 
galactosidase gene. The leucine uptake of the culture is 
higher after addition of leucine. 

35 The experiments described above show quite surprisingly 
that the genetic stability of the integrated plasmids is 
due to the fact that the intracellular production of 
leucine is required for growth, in spite of the presence 
of an appreciable amount of extracellular leucine. Due 

40 to the inefficiency of the LEU2d promoter, production of 
sufficient amounts of leucine is only possible when a 
large number of LEU2d genes is present on the chromo- 
some. 

Such a large number of integrated genes can be stably 
45 maintained when the integration site is in, or directly 
linked to the ribosomal DNA locus and under proper 
growth rate conditions and medium composition. 
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Media composition g/1 

Compound A B C D 

5 NH4C1 7.6 7.6 7.6 

KH2P04 2.8 4.0 4.0 

MgS04.7aq 0.6 0.6 0.6 

trace metals 10 10 

yeast extract 5 10 
10 (Difco)## 

peptone 0.0 0.0 20 

DHW (UF) 125 0.0 

glucose 5.5 20 20 

galactose 10 20 10 

15 histidine 0.05 0.2 0.2 

vitamin 2 1 l 
solution 

leucine 0.05 
(added) 

20 pH 5.0 5.0 5.0 5.0 

DHW: de-proteinized hydro lysed whey ex DMV Netherlands 
UF : ultra filtrated molecular weight cutoff 10 kD. 

## : yeast extract contains 8-9 %w/w leucine. 

25 



EXAMPLE 11 PARAMETERS AFFECTING THE STABILITY OF 
MULTICOPY INTEGRANT SU50B 

3 0 Strain SU50B (as described in example 10) was cultivated 
in shake flasks in media c and D. This gives, an example 
of two extreme media ranging from a complex, rich medium 
to a minimal medium. Medium C (YPGAL) contains 524 mg/1 
leucine. Surprisingly, the integrant was stable in YPGAL 

35 media for many sub-cultivations (see example 8) .In 
medium D and other minimal media with leucine the 
expression decreased rapidly. The residual concentration 
of leucine (derived from the yeast extract and peptone) 
in the medium C decreased from 524 mg/1 to 393 mg/1. The 

40 leucine concentration in medium D reduced from 50 to 

about 20. mg/1. The growth rate of the strain in minimal 
media is about 0.1 h~l while the growth rate on medium C 
is 0,27 h"l. Addition of yeast extract increases the 
growth rate up to 0.27 h~l combined with an improved 

45 stability of the er-galactosidase production. 

The complex media not only increase the growth rate, but 
also increase the a— galactosidase concentration in the 
culture . 



These experiments clearly show that the multicopy 
integrant was stable at high growth rates in the 
presence of leucine. Based on this finding an efficient 
fermentation process can be developed meaning a 
55 substantial amount of protein per culture volume per 
hour* can be obtained. 
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EXAMPLE 12 EXPRESSION OF THE SYNTHETIC LIPASE GENES 
IN HANSENULA POLYMORPH A- 



The synthetic lipase genes were integrated in the B . 
5 polymorphs, genome using the following procedure (Fig. 
26; in each figure of this example the used restriction 
enzyme recognition sites are marked with an asterisk; 
restriction recognition sites between brackets are 
removed due to the cloning procedure) : 

10 

a. PI asm id pUR6038 (Fig- 27) was digested to comple- 
tion with the restriction enzymes £coRI and £coRV. 
After separation of the fragments by agarose gel 
electrophoresis the vector fragment was isolated as 

15 described in Example 2. 

b. Several different synthetic cassettes were 
assembled as described in Example 3. These 
cassettes encoded a number of amino acids necessary 
for a correct joining of the invertase signal 

20 sequence with different length of the ~p re-mature 

lipase gene. This was done to establish the most 
optimal construct with respect to expression, 
processing and export of the lipase enzyme. 
Furthermore, these cassettes had £coRI and fcoRV 

25 ends. 

Typical examples are given in Fig. 26. 

c. The assembled cassettes were ligated in the vector 
prepared under a. 

d. The plasmids thus obtained (pUR6850, 6851 and 6852 
30 Fig. 28) were partially digested with the restric- 
tion enzyme xhol and the linearized plasmid was 
isolated • 

e. Plasmid pUR350l (21, Fig. 29) was partially 
digested with XhoX. After agarose gel electropho- 

35 res is a DNA fragment of approximately 1500 bp was 

isolated, containing the H- polymorpha methanol 
oxidase (MOX) promoter followed by the first amino 
acids of the s. cerevxsiae invertase signal 
sequence Xhol DNA fragment from position 0 to 1500 

40 from PTJR3501) . 

f. The 1.5 kb fragment from e. was ligated in the 
vector fragments as prepared in d resulting in 
plasmids UR6860, 6861 r 6862 Fig. 30. 

45 g. The ligation mixture was transformed to E. coll. 
From single colonies, after cultivation, the 
plasmid DNA was isolated and the correct plasmids, 
as judged by restriction enzyme analysis, were 
selected and isolated in large mounts. 

50 h. The correct plasmids obtained in step g. (eg. 

pUR6B60, 6861, 6862 Fig. 30) were digested to 
completion with BamUI, after which the sticky ends 
were filled in with Klenow polymerase (example 2). 
As the next step the linear plasmids were digested 

55 with £coRI, and the filled in BamHI-£coRI DNA 

fragments comprising the MOX promoter, invertase 
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signal sequence and synthetic lipase gene with a 
length of approximately 2.5 kb were isolated out of 
agarose gel. 

i. Plasmid pUR3511 (the R . polymorph* methanol oxidase 
5 (MOX) terminator cloned in the B&mUT, ffindl 

restriction sites of pEMBIi9, Fig. 31) was digested 
with Smal and fcoRl, after which the vector was 
isolated out of an agarose gel. 
j. The pUR3511 vector and the 2.5 kb fragments, 
10 obtained in h. , were ligated and cloned in E . coll. 

In the constructs obtained, the lipase gene is 
followed by the MOX transcription terminator. 
Typical examples of these constructs are pUR6870, 
6871 and 6872 (Fig. 32) . 
15 k. These plasmids were digested with EcoHl and 

Hindlll, after which the fragments of approximately 
3 kb. were isolated from an agarose gel. The sticky 
ends were filled in with Klenow polymerase. 
1. Plasmid pUR3513; this is plasmid YEpl3 (37) from 
20 which the 2/tm sequences have been deleted by 

removal of a Sail fragment (Fig. 33) was digested 
with fyn'SXm 

m. The line ear plasmid pUH3513 and the fragments 

obtained in k. were ligated to obtain the final 
25 constructs among which pUR6880, 6881 and 6882 (Fig. 

34). j 

Introduction oi the expression cassettes in the R* 
polymorphs, genome. 

30 

Transformation of plasmid DNA to the Hans caul* polymor- 
phs strain A16 using selection for IiEU* phenotype can be 
performed as described by (21, 38, 39). 
Analysis of . the integrants can be performed using the 
35 Southern blot procedure (7). 

*; ■ - 

EXAMPLE 13 PRODUCTION OF GUAR a -GAIACTOS IDAS E IN 
HANSENULA POLYMORPH A USING MULTICOPY INTEGRATION- 

40 

In this example the expression of a heterologous 
protein, a— galactosidase from guar Cyamops Is tetrago- 
nolob* ) , using multicopy integration in jfansenula 
polymorphs. , is described. The gene encoding a-galac- 

45 tosidase was fused to homologous expression signals as 
is described in" Overbeeke (21) resulting in the 
expression vector pUR3510. The t*-galactosidase expres- 
sion cassette of pUR3510 consists of the H- polymorpha 
methanol oxidase promoter, the S. cerevxs Ue invertase 

50 signal sequence, the a-galactosidase gene (encoding 

mature a^galactosidase) and the R . polymorpha methanol 
oxidase terminator. This expression cassette was 
isolated and inserted in the multicopy integration 
vector pUR279Q resulting in pUR3540. The multicopy 

55 integration vector. pUR3540 was transformed to fi, 

polymorphs and surprisingly multicopy integrants were 
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obtained. The obtained multicopy integrants expressed 
and secreted the plant protein a-galactosidase. This 
example clearly demonstrates that it is possible to 
obtain multicopy integrants in ff. polymorph* and these 
5 multicopy integrants can be used for the production of 
proteins. Also it appeared that multicopy integrants in 
E. polymorphs were obtained using S. cerevisiae 
ribosomal DNA sequences and a s. cerevisiae deficient 
selection marker. All DNA manipulations were carried out 
10 using standard techniques as described in Maniatis (7) • 

The plasmid pUR3510 (21) was digested with ffindlll and 
BamHI and the DNA fragment containing the a-galac- 
tosidase expression cassette was isolated. The multicopy 

15 integration vector pUR2790 is derived from pUR2740 by 
replacing the BglZI-ffindlTT 500 bp fragment containing 
5. oligorhiza DNA and a 100 bp S. cerevisiae ribosomal 
DNA by a B^III-Hlndlll poly linker sequence containing 
multiple cloning sites. The multicopy integration vector 

20 pUR2790 was partially digested with Bindlll and 

digested to completion with Bgl II and subsequently the 
vector fragment was isolated. The B^III-ffindlll vector 
fragment and the J? in dill -B amHX fragment, containing the 
a-galactosidase expression cassette, were ligated 

25 resulting in the multicopy integration vector pUR3540 
(see also Fig- 35? all used restriction recognition 
sites are marked with an asterisk) . The ligation mixture 
was transformed to E. coll . From single colonies, after 
cultivation, the plasmid DNA was isolated and the 

3 0 correct plasmids, as judged by restriction enzyme 

analysis, were selected and isolated in large amounts. 

The multicopy integration vector pUR3540 was linearized 
with Smal and the linearized vector pUR3540 was 

35 transformed to H. polymorphs A16 (LEU2*) using the 

procedure described by Roggenkamp et al (39) . The LEU2 + 
colonies, being the multicopy integrants, were isolated 
and used for further experiments. The multicopy 
integrant and the parent strain A16 as a control were 

40 grown under non-selective conditions (1% Yeast Extract, 
2% Bacto-peptone, 2% glucose for 40 hours at 37 °C) and 
chromosomal DNA was isolated as described by Janowicz et 
al. (40). The total DNA was digested with Jlindlll and 
the digested chromosomal DNA was analyzed by Southern 

45 hybridization (7) . An XhoX 878 bp fragment, containing 
a part of the methanol oxidase promoter [position -1313 
to position -435, Ledeboer (41)], was labelled with 3 2P 
and used as a probe. The result of this hybridization 
experiment is shown in Fig. 36, (lane 2 parent strain 

50 and lane 1 multicopy integrant). In lane 2, the parent 
strain, a DNA fragment of approximately 14 kb can be 
seen to hybridize with the methanol oxidase promoter 
probe, corresponding to a DNA fragment containing the 
entire methanol oxidase gene which is present in a 

55 single copy in the genome. In lane 1, the. .multicopy 
integrant, an additional hybridization signal was 
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obtained , corresponding to a DNA fragment of approximate- 
ly 8 kb. This fragment is part of the integrated vector 
pUR3540 being the Jlindlll fragment containing amongst 
others the et-galactosidase expression cassette and the 
5 methanol oxidase promoter- By comparison of the inten- 
sities of the hybridization signals in lane 2 it can be 
estimated that over 20 copies of the multicopy integra- 
tion vector are integrated in the ff. polymorphs genome. 
The multicopy integrant was analyzed for a-galactosidase 

10 expression as described by Overbeeke (21) - Upon 

induction with methanol or— galactosidase was detected in 
the medium . using the enzyme activity assay. 
This example clearly demonstrates that multicopy 
integration can be achieved in B . polymorphs and thus 

15 the multicopy ^integration system can be used for the 
production of (e.g. heterologous) proteins in H. 
polymorphs . This example also demonstrates that it is 
possible to obtain multicopy integration of an expres- 
sion vector in the genome of a yeast (e.g. H . polymor- 

20 pha) using the two prerequisites, ribosomal DNA 

sequences and a deficient selection marker. Such a 
selection marker can be homologous or originating from 
another host (e.g. S . cererisiae) as long as the 
expression level of the deficient gene is below a 

25 critical levelY 



EXAMPLE 14. TMULTICOPY INTEGRATION IN KLUYVEROMYCES . 

30 In this example a procedure is described to obtain 

multicopy integration of a plasmid vector in the genome 
of Kluyveromyces oarxianus var, lactls . Multicopy 
integration vectors were constructed containing 
ribosomal DNA sequences originating from s, cerevlsiae 

35 and deficient selection markers origination from the 
multicopy integration vectors (pMIRY6-TAl, pMIRY6-TA2 
and pMIRY6-TA3) . The multicopy integration vectors were 
transformed to" a TRP~ K. marzianus strain surprisingly 
resulting in trans form ants having multiple copies of the 

40 vector integrated in the genome of the Kluyveromyces 
strain. Also this example clearly demonstrates that 
multicopy integration can be obtained in yeasts using 
either homologous or heterologous deficient selection 
markers • 

45 

From the multicopy integration vectors pMIRY6-TAl, 
PMIRY6-TA2 and pMIRY6-TA3 (see example 9) the S. 
cerevislae ribosomal DNA was removed by digestion with 
SphI P isolation of the vector fragment followed by 

50 ligation of the vector fragment. In the resulting vector 
a 4400 bp EcoRI K. manianus ribosomal DNA fragment 
(42, Fig, 37) was" cloned in the EcoRI site resulting in 
the multicopy integration vectors pMIRK7ATl, pMIRK7AT2 
and pMIRK7AT3 (Fig. 38) . The multicopy integration 

55 vectors, after linearization with Sad, were 

transformed to the marxlanus strain MSK 110 (a, URA- 
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A, TRP1: :TJRA3) , (43) using the LiAc procedure (44). 
Trans formants were selected for TRP + phenotype. The 
obtained integrants were grown under non-selective 
conditions (0.67% Yeast Nitrogen Base with amino acids, 
5 2% glucose, 30 °C) , for 6-7, 30-35 and 60-70 genera- 
tions . This was performed by growing the integrants to 
OD 550 nm of 2 to 3, dilution in fresh non-selective 
medium to OD 550 nm of 0.1 and followed by growth to OD 
550 nm of 2 to 3. This cycle was repeated several times. 

10 From these integrants total DNA was isolated (45) , 

digested with PstI and separated on a 0.8% agarose gel, 
followed by Southern analysis using the EcoKZ-PstT 
fragment of the JC. 1 act: is ribosomal DNA (Fig. 37) as a 
probe, in Fig. 39 the result obtained with the integrant 

15 of pMIRK7ATl are shown. In lane 5 the hybridisation of 
the rDNA probe with the digested chromosomal DNA of the 
host strain is shown, the rDNA probe hybridizes with the 
± 150 repeated copies of the rDNA unit. In lane 1 the 
pMIRK7ATl integrant is shown. The hybridisation with 

20 rDNA copies, as for the parent strain, can- be seen but 
in addition the repeated integrated copies of the 
multicopy integration vector. As a control the hybri- 
disation result of the linearized multicopy integration 
vector with the rDNA probe (lane 6) is shown. The 

25 relative intensity of the hybridization signal can be 
used to estimate the copy number of integrated vector. 
The hybridisation signal with the rDNA units corresponds 
± 150 copies. Comparison of the intensity of hybridisa- 
tion signal of the integrated copies of the vector with 

3 0 the intensity of the hybridization signal with the rDNA 
units the copy number can estimated to be at least 50. 
This result shows that surprisingly multicopy integra- 
tion can also be obtained in the yeast genus KLuyvero- 
myces. In lane 2, 3 and 4 the integration pattern is 

35 shown after non-selective growth of the multicopy 

integrant, also used in lane 1, for 6-7, 30-3 5 and 60-70 
generations respectively. It can clearly be seen that 
the relative intensity of the hybridisation signals with 
the integrated vector does not decrease. This surprising 

40 finding proves that the multicopy integration is 

completely stable even after prolonged growth under non- 
selective conditions. Similar results were obtained 
using the multicopy integrants of pMIRK7AT2 and 
PMIRK7AT3 . 

45 This example clearly demonstrates that it is possible to 
obtain multicopy integration in Xluyveromyces using a 
multicopy integration vector with the two prerequisites 
ribosomal DNA sequences and a deficient selection 
marker, in this example even a heterologous selection 

50 marker. The multicopy integrants are stable for at least 
60 generation under non-selective conditions. By 
analogy with the examples 8 and 13, production of a 
protein in Kluyveromyces using multicopy integrants can 
be obtained by insertion of an expression cassette, with 

55 a gene coding for a protein of commercial interest, in 
the multicopy integration vector and transformation of 
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the resulting vector including the expression cassette* 
These multicopy integrants can be used for the produc- 
tion of the protein of commercial interest. Because of 
the unique properties of the multicopy integration 
5 system, high copy number and high genetic stability, 

these multicopy integration transf ormants can be used in 
any known fermentation production process for the 
production of a, commercially interesting, protein. 
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LEGENDS TO FIGURES 

In the figures of the different plasmids the order of 
length is given. 

5 

Figure 1 A schematic drawing of the plasmid 

pUR6002 comprising the P. glumae lipase gene- 
Figure 2 The complete nucleotide sequence of the 
10 P. glumae lipase gene; for details see text. 

Figure 3 A schematic drawing of the construction 

pUR6103 ; for details see text. 

15 Figure 4 A schematic drawing of the construction 

of the plasmids pUR6107 and pUR6108; for details see 
text. 

Figure 5 

20 A. The complete nucleotide sequence of the 

synthetic lipase gene in pUR6038. 

B. The nucleotide sequence of the 3' 

flanking region of the synthetic lipase gene in pUR6600. 

25 Figure 6 An example of the construction of a 

cassette in the synthetic lipase gene. 

Figure 7 A schematic drawing of the construction 

of plasmid pUR6131; for details see text. 

30 

Figure 8 An example of the improved resistance of 

a mutant lipase in a detergent system. 

Figure 9 A schematic drawing of the construction 

35 of the plasmid pUR6801. Plasmid pUR6801 is a s. 
cerevislAB/E . coll shuttle vector comprising the 
synthetic lipase gene with yeast expression- and 
secretion sequences. 

40 Figure 10 A western analysis of lipase expression 

in 5. cerevlslae using pUR6801. The corresponding blot 
was incubated with lipase specific antibodies. 
Standards: 1 /xg and 0.25 /ig P. glumae lipase. 

45 SU10: total intra-cellular protein of the 

host strain SU10. 
TF17 cells: total intra-cellular protein of SU10 

transformed with pUR6801. 
TF17 supernatant: total extracellular protein of SU10 
50 transformed with pUR680l. 

Figure 11 A schematic drawing of the multicopy 

integration vector pUR2790* 
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Figure 12 A schematic drawing of the construction 

of pUR6803. Plasmid pUR6803 is a multicopy integration 
vector comprising the lipase expression cassette. 

5 Figure 13 A western analysis of lipase expression 

of multicopy integrants. The multicopy integrants were 
obtained by transforming S. cerevisiae strain SU50 with 
the multicopy integration vector pUR6803; 7 independent 
multicopy integrants are shown. The corresponding blot 
10 was incubated with lipase specific antibodies. 
Standards: 1 pg and 0.25 j*g P. gl uaae lipase* 

SU50: total intracellular protein of host strain 

SU50. 

15 1-7: total intra-cellular protein of 7 independent 
multicopy integrants* 

Figure 14 A schematic drawing of the multicopy 

integration vector pTJR2774 comprising the a-galacto- 
20 sidase expression cassette. 

Figure 15 

A, A schematic drawing of the genetic orga- 

nization of the ribosomal DNA locus of S. cerevisiae • 
25 B. A schematic drawing of the genetic 

organization of a multicopy integration of pUR2774 in 
the ribosomal DMA locus of S. cerevisiae (multicopy 
integrant SU50B) • 

30 Figure 16 Ethidium bromide stained agarose gel of 

undigested and B^ZII digested total DNA of the multicopy 
integrants SU50B and SU50C. 

Figure 17 Southern blot of total DNA of multicopy 

35 integrants using the a-galactosidase probe. 

SU50 * JJ^III: parent strain YT6-2-1 L (SU50) total DNA 

digested with BglXI. 
C * BglXXi total DNA of multicopy integrant SU50C 

40 digested with Bgl II. 

B * BglXXz total DNA of multicopy integrant SU50B 

digested with fig-Ill. 
C: undigested total DNA of multicopy 

integrant SU50C. 
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Figure 18 Southern blot of multicopy integrants 

using the ribosomal DNA probe. 



SU50 * BglXXz parent strain YT6-2-1 Jj (SU50) total DNA 
50 digested with BglXX. 

C * B^III: total DNA of multicopy integrant SU50C 

digested with BglXl. 
B * B^III: total DNA of multicopy integrant SU50B 

digested with Bglll. 
55 C: undigested total DNA of multicopy 

integrant SU50C. 
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Figure 19 Structure of the TOPI gene from (32) * 

[AT] : poly(dAtdT) stretch, (UAS) , partial general 
control upstream activation site- The actual sequence is 
indicated for the putative TATA elements. The TRP1 
5 coding sequence is indicated by the black bar - The 
various mRNA species are indicated by the arrows. The 
scale is in base pairs. The restriction sites used to 
construct the promoter deletions are indicated. 

10 Figure 20 Construction of plasmids pMIRY6-T±l, 

pMUlY6-T±2 an4 pMIRY6-T±3 containing TRP1 alleles with 
various promoter deletions. The coordinates indicated 
for several of the restriction sites show their position 
with respect to the ATG start codon (the A being 

15 position +1) . For each plasmid the position (-6/ -30 or 
-102) of the 5 '-end of the TRP1 gene is indicated. A 
more detailed map of the rDNA fragment present in the 
various pMIRY6 plasmids is shown at top right. The 
non-transcribed rDNA spacer is abbreviated as "N" . 

20 

Figure 21 Plasmid copy number of pMIRY6-TAl (lanes 

1 and 2) r pMIRY 6— TA2 (lanes 3 and 4) and pMTRY6-TA3 
(lanes 5 and 6) transformants. Total DNA was isolated 
from the transformed cells and digested with £coRV in 
25 the case of pHTR Y 6 — T A 1 and s&cX in the case of 

pMIRY6-TA2 and pMIRY6-TA3. The fragments were separated 
by electrophoresis on an 0.8% gel. The DNA was stained 
with EtBr. The plasmid and the rDNA bands are indicated. 

30 Figure 22 Construction of pMIRY7-UA containing a 

URA3 gene in which most of the promoter has been 
deleted. The coordinates indicated for several of the 
restriction sites refer to their positions with respect 
to the ATG start codon (the A being position +1). The 

35 position of the 5 '-end of the URA3± is indicated (A16) . 
A more detailed map of the rDNA fragment present in 
pMIRY7-UA is shown at top right. The non-transcribed 
spacer is abbreviated as "N". 

40 Figure 23 Plasmid copy number of pMIRY7-UA 

transf ormants . Total DNA was isolated from the 
transformed cells and digested with Sad. The fragments 
were separated by electrophoresis on an 0.8% gel. The 
9.1 kb rDNA band and the 6.4 kb plasmid band are 

45 indicated. 

Figure 24 * Stability of multicopy integrant SU50B in 
continuous culture; for details see text. 

50 Figure 25 Southern blot of total DNA digested with 

BglXX of multicopy integrant SU50B isolated at different 
stages of thdi continuous culture. 

SU50B grown in shake flask. 

SU50B at the start of the continuous culture. 
SU50B after the addition of leucine. 



SU50B 1: 
55 SU50B 2: 
SU50B 3: 
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Figure 2 6 A schematic drawing of the construction 

route of the lipase expression vectors for H. polymorpha 
pUR6880, pUR6881 and pUR6882. Each individual stage of 
the construction route is shown in a separate drawing 
5 (Fig. 27 to 34) ; for details see text. 



Figure 


27 


A 


schematic drawing 


of 


PUR6038 . 


Figure 


28 


A 


schematic drawing 


of 


PUR6852. 


Figure 


29. 


A 


schematic, drawing 


of 


PHR35Q1. 


Figure 


30 


A 


schematic drawing 


of 


pUR6862. 


Figure 


31 


■A 


schematic drawing 


of 


PUR3511. 


Figure 


32 


A 


schematic drawing 


of 


pUR6872 . 


Figure 


33 


A 


schematic drawing 


of 


pTJR3513. 


Figure 


34 


A 


schematic drawing 


of 


pUR6882. 


Figure 


35 


A 


schematic drawing 


of 


the R . polymorpha 



multicopy integration vector pUR3540 comprising the a- 
25 galactosidase expression cassette. All used restriction 
recognition enzyme sites are marked with an asterisk. 

Figure 36 Southern analysis of total DNA digested 

with J?indIII of the H . polymorphs, multicopy integrant 
30 obtained using pUR3540. 

lane 1: multicopy integrant. 

lane 2: untrans formed host strain. 

35 Figure 37 The cloned ribosomal DNA of jc. inctis is 

shown (42) . From this vector the indicated BamHI-SacI 
fragment was subcloned' in pTZl9U (46) . From the 
resulting vector the EcoRI fragment was used in the 
construction of pMIRK7ATl, pMIRK7AT2 and pMIRK7AT3 . The 

40 EcoRl-PstT fragment was used as a probe in the 
hybridization experiments. 

Figure 38 A schematic drawing of the multicopy 

integration vectors pMIRK7ATl, pMIRK7AT2 and pM!RK7AT3. 

45 

Figure 39 - Hybridization of digested chromosomal DNA 
of multicopy integrant after growth under non-selective 
conditions with ribosomal DNA probe. Lane 1- 4: 
multicopy integrant MIRK7AT1; lane 5: parent strain MSK 
50 110, lane 6: linearized multicopy integration vector 

PMIRK7AT1. Chromosomal DNA was isolated at the start of 
the experiment (lane 1) , after 6-7 generations (lane 2), 
after 3 0-3 5 generations (lane 3) and after 60-70 
generations (lane 4). 
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CLAIMS 

1- Process for preparing a, homologous or 

heterologous, protein by a eukaryote transformed by 
multicopy integration of an expression vector into the 
genome of a host eukaryote, said expression vector 
containing both an "expressible gene" as herein before 
defined encoding said homologous or heterologous protein 
and a so-called "deficient selection marker needed for 
the growth of the yeast or mould in a specific medium" 
as herein before defined, wherein said expression vector 
contains ribosomal DNA sequences enabling multicopy 
integration of said expression vector in the ribosomal 
DNA locus of the eukaryote genome. 

2. Process according to claim 1, in which said 
deficient selection marker is selected from the group 
consisting of a LEU2& gene, a TRFld gene, and a URA3d 
gene. 

3. Process according to claim 1, in which the 
eukaryote is a fungus. 

4 . Process according to claim 3 , in which the 
fungus is a yeast, preferably selected from the group 
consisting Of the genera Saccharomyces , Kluyveromyccs 
and H&nsenula or a mould, preferably selected from the 
group consisting of the genera Aspergillus , Rhizopus and 
T r ichod erma . 

5. Process according to claim 1, in which said 
transformed eukaryote is grown in a medium containing an 
ingredient, which is essential for the growth of the 
eukaryote, at a concentration whereby the uptake of 
said ingredient is rate-limiting, so that de novo 
synthesis of said ingredient is required for a growth- 
rate above a certain minimum value which value depends 
on the host organism and the process conditions. 

6. Process according to claim 5, in which the 
transformed eukaryote is grown in a so-called "complete" 
or non-selective medium, which contains all the 
ingredients necessary for growth of the eukaryote. 

7. Process according to claim 6, in which the 
complete medium is an industrially applied growth 
medium, preferably one selected from the group consist- 
ing of molasses, whey, yeast extract and mixtures 
thereof. 
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8. A process according to claim 1 in which the 
transformed eukaryote contains the gene or genes 
required for expression of said protein in a multimeric 
form in one of its chromosomes in f or directly linked 
to, a locus coding for a ribosomal RNA while at the same 
locus also multimeric copies of a deficient gene 
encoding a protein required in the biochemical pathway 
for the synthesis of said "essential nutrient* 1 are 
present . 

9. A process according to claim 8, in which the 
expressible gene encodes an enzyme, preferably a 
hydrolytic enzyme, in particular a lipase, or a 
genetically modified form of such enzyme. 

10. A process according to claim 9, in which the 
lipase is selected from the group of lipases that cross- 
reacts with antisera raised against a lipase from 
Chromobacter vXscosum var lipolyticum NRRL B— 3673, 
lipases that cross-reacts with antisera raised against 
lipase from Alcallgenes PL-679, ATCC 31371 or FERM-P 

3783, lipases that cross-reacts with antisera raised 
against a lipase from Pseudomonas fluorescens I AM 1057, 
and modified forms of such cross-reacting lipase. 

11. A process according to claim 9, in which the 
lipase is encoded by a gene having the nucleotide 
sequence given in Figure 2 or any nucleotide sequence 
encoding the same amino acid sequence as specified by 
that nucleotide sequence or encoding modified forms of 
this amino acid sequence resulting in a lipase with a 
better overall performance in detergents systems than 
the original lipase. 

12. A process according to claim 1, in which the 
eukaryote is deficient for the synthesis of an 
"essential nutrient" as herein before defined and 
whereby the deficient selection marker can contribute to 
complementation of the synthesis of the "essential 
nutrient" . 

13 . a process according to claim 12 , in which the 
deficiency of the parent strain is achieved by replace- 
ment of a gene coding for an enzyme effective in the 
biosynthetic pathway of producing said essential 
nutrient. 

14. A process according to claim 13, in which the 
enzyme., for which the parent strain is deficient, cata- 
lyses a reaction in a part of the biosynthetic pathway 
that is not branched until the essential nutrient is 
formed . 
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15. A process according to claim 12, in which the 
essential nutrient is an amino acid, a nucleotide or a 
vitamin, in particular one of the amino acids leucine, 
tryptophan or uracil. 

16. A process according to claim 1, in which the 
expression vector contains 

(i) a ds ribosomal DNA or part thereof e.g. a ds DNA 
sequence that codes for a ribosomal RNA, and 

(ii) a DNA sequence containing in the 5' — > 3' direction 
in the following order: 

(ii) (a) a powerful promoter operable in the host 
organism, 

(ii) (b) optionally a signal sequence facilitating the 
secretion of said protein from the host 
eukaryote , 

(ii) (c) a structural gene encoding the protein, 
(ii) (d) an efficient terminator operable in the host 
eukaryote , 

in addition to the sequences normally present in a 
vector. 

17. A process according to claim 16, characterized 
in that the ribosomal DNA is selected from the group 
consisting of ribosomal DNA's occurring in moulds, in 
particular • moulds of the genera Aspergillus , Rhxzapus 
and Trlchoderma , and yeasts, in particular yeasts of the 
genera Saccharomyces , Kluyveromyces , Han senilis, and 
Pichia* 

18. A process according to claim 16, in which the 
vector has approximately the same length as * one 
ribosomal DNA unit of the host organism. 

19. A process according to claim 16, in which a 
promoter is selected from the group consisting of 

(i) the Gal7 promoter," the GAPDH promoter, or the PGK 
promoter, if the host belongs to the genus 
Saccharomyces , 

(ii) the inulinase promoter, the PGK promoter or the 
XAC4 promoter, if the host belongs to the genus 
Kluyv&romyces , 

(iii) the DHAS promoter or MOX promoter, if the host 
belongs to the genus Hansenula , 

(iv) the ' glucoamylase promoter, glucose-oxidase promoter 
or the GAPDH promoter, if the host belongs to a 
mould of the genus Aspergillus , and 

(v) the cellulase promoter or the GAPDH promoter, if 
the host belongs to moulds of the genera Rhizopus 
and Trichodcrma . 



20. A process according to claim 19, in which the 

protein is an oxidase and the host cell belongs to the 
genera Sa.nsen.ula. or Plchia or Aspergillus * 
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21. A process according to claim 16 , in which the 
structural gene encoding the protein encodes the light 
or heavy chain of an immuno-globulin or preferably both 
genes, or part of the light or heavy chain of an 
immunoglobulin, preferably that part coding for what 
normally is called FAB fragment, or that part thereof 
that codes for the variable regions. 

22. A process according to claim 21, in which the 
gene or genes are modified by genetic engineering 
resulting in modified immunoglobulins or immunoglobulins 
with catalytic activity (Abzymes) . 

23. A process according to claim 16, in which the 
expression vector additionally contains a deficient 
gene coding for an enzyme that has been disrupted or 
deleted from the chromosome of the host cell. 

24- A process according to claim 23, in which the 

deficient gene encodes an enzyme effective in the 
biosynthetic pathway of producing an essential nutrient, 
such as an amino acid like leucine, tryptophan or 
uracil, a nucleotide or a vitamin. 

25. A process according to claim 1, in which the 
process is selected from the group consisting of a 
normal batch fermentation, a fed-batch fermentation, and 
a continuous fermentation. 

26. A process according to claims 25, charac- 
terized in that the medium contains the essential 
nutrient in such a concentration that at least 20, 
preferably at least 50, copies of the deficient gene are 
maintained in the chromosome, said deficient gene enco- 
ding an enzyme involved in the biosynthesis of that 
essential nutrient. 

27. A process according to claim 26, characterized 
in that the growth rate of the host is between 20 and 
100 %, preferably between 80 and 100 %, of the maximum 
growth rate of a similar host not deficient for said 
essential nutrient under the same fermentation condi- 
tions. 
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2fr9 

c 
1 

a 
I 

ATGCTCAGATCGATGCGTTCCAGGGTGGCGGCGAGGGCGGTGGCATGGGCGTTGGCGGTG 

1 + + 4- + -t- + 60 

TACCAGTCTAGCTACGCAAGGTCCCACCGCCGCTCCCGCCACCGTACCCGCAACCGCCAC 

-39MVRSMRSRVAARAVAWALAV 

ATGCCGCTGGCCGGCGCGGCCGGGTTGACGATGGCCGCGTCGCCCGCGGCCGTCGCGGCG 

61 + + + + + + 120 

TACGGCGACCGGCCGCGCCGGCCCAACTGCTACCGGCGCAGCGGGCGCCGGCAGCGCCGC 

MPLAGAAGLTMAASPAAVAA 

sig.seq. -1 t 1 

GACACCTACGCGGCGACGCGCTATCCGGTGATCCTCGTCCACGGCCTCGCGGGCACCGAC 

121 + + + -t- + + 180 

CTGTGGATGCGCCGCTGCGCGATAGGCCACTAGGAGCAGGTGCCGGAGCGCCCGTGGCTG 

DTYAATRYPVILVHGLAGTD 
mature lipase 

AAGTTCGCGAACGTGGTGGACTATTGGTACGGAATCCAGAGCGATCTGCAATCGCATGGC 

181 + +--- + + - + + 240 

TTCAAGCGCTTGCACCACCTGATAACCATGCCTTAGGTCTCGCTAGACGTTAGCGTACCG 

KFANVVDYWYGIQSDLQSHG 

GCGAAGGTGTACGTCGCGAATCTCTCGGGATTCCAGAGCGACGACGGGCCGAACGGCCGC 

241 + + + + + ---+ 300 

CGCTTCCACATGCAGCGCTTAGAGAGCCCTAAGGTCTCGCTGCTGCCCGGCTTGCCGGCG 

AKVYVANLS GFQSDDGPNGR 

P 
v 
u 

I 
I 

GGCGAGCAGCTGCTCGCCTACGTGAAGCAGGTGCTCGCGGCCACCGGCGCGACCAAGGTG 

301 - +- +- + + + + 360 

CCGCTCGTCGACGAGCGGATGCACTTCGTCCACGAGCGCCGGTGGCCGCGCTGGTTCCAC 

GEQLLAYVKQVLAATGATKV 

AACCTGATCGGCCACAGCCAGGGCGGCCTGACCTCGCGCTACGTCGCGGCCGTCGCGCCG 

351 + + + +- + + 420 

TTGGACTAGCCGGTGTCGGTCCCGCCGGACTGGAGCGCGATGCAGCGCCGGCAGCGCGGC 



*T T 



IGK5QGGLTSRYVAAVAP 



CAACTGGTGGCCTCGGTGACGACGATCGGCACGCCGCATCGCGGCTCCGAGTTCGCCGAC 

421 +- -5- -+ + ---+ 480 

GTTGACCACCGGAGCCACTGCTGCTAGCCGTGCGGCGTAGCGCCGAGGCTCAAGCGGCTG 

QLVASVTTIGTP.HRGSEFAD 
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Fig. 2(Cont 11 \ 

I 

TTCGTGCAGGACGTGCTGAAGACCGATCCGACCGCGCTCTCGTCGACGGTGATCGCCGCC 

4S1 h — + + + + 540 

AAGCACGTCCTGCACGACTTCTGGCTAGGCTGGCCCGAGAGCAGCTGCCACTACCCGCGG 

FVQDVLKTDPTGLSSTVIAA 

TTCGTCAACGTGTTCGGCACGCTCGTCAGCAGCTCGCACAACACCGACCAGGACGCGCTC 

541 + + + -+ + 600 

AAGCAGTTGCACAAGCCGTGCGAGCAGTCGTCGAGCGTGTTGTGGCTGGTCCTGCGCGAG 

FVNVFGTLVSS SHNTDQDAL 

GCGGCGCTGCGCACGCTCACCACCGCGCAGACCGCCACCTACAACCGGAACTTCCCGAGC 

601 -- -t- + + + -r + 660 

CGCCGCGACGCGTGCGAGTGGTGGCGCGTCTGGCGGTGGATGTTGGCCTTGAAGGGCTCG 

AALRTLTTAQTATYNRNFPS 

GCGGGCCTGGGCGCGCCCGGTTCGTGCCAGACGGGCGCCGCGACCGAAACCGTCGGCGGC 

661 + --+- + -i- + 720 

CGCCCGGACCCGCGCGGGCCAAGCACGGTCTGCCCGCGGCGCTGGCTTTGGCAGCCGCCG 

AGLGAPGSCQTGAATETV GG 

AGCCAGCACCTGCTCTATTCGTGGGGCGGCACCGCGATCCAGCCCACCTCCACCGTGCTC 

721 + +- + + + + 780 

TCGGTCGTGGACGAGATAAGCACCCCGCCGTGGCGCTAGGTCGGGTGGAGGTGGCACGAG 

SQHLLYSWGGTAIQPTSTVL 

GGCGTGACCGGCGCGACCGACACCAGCACCGGCACGCTCGACGTCGCGAACGTGACCGAC 

781 -h + + + + &uO 

CCGCACTGGCCGCGCTGGCTGTGGTCGTGGCCGTGCGAGCTGCAGCGCTTGCACTGGCTG 

GVTGATDTSTGTLDVANVTD - 

CCGTCCACGCTCGCGCTGCTCGCCACCGGCGCGGTGATGATCAATCGCGCCTCGGGGCAG 

841 + +- + + + 900 

GGCAGGTGCGAGCGCGACGAGCGGTGGCCGCGCCACTACTAGTTAGCGCGGAGCCCCGTC 

PSTLALLATGAVMINRASGQ 

P 
s 
c 
I 

AACGACGGGCTCGTCTCGCGCTGCAGCTCGCTGTTCGGGCAGGTGATCAGCACCAGCTAC 

901 + + ---+ ~ + 96C 

TTGCTGCCCGAGCAGAGCGCGACGTCGAGCGACAAGCCCGTCGAGTAGTCGTGGTCGATG 

NDGLVSRCSSLFGQVISTSY 
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Fig. 2(Conf2). 



V 
v 
u 
I 
I 

CACTGGAACCATCTCGACGAGATCAACCAGCTGCTCGGCGTGCGCGGCGCCAACGCGGAA 

961 + + + + +--- + 1020 

GTGACCTTGGTAGAGCTGCTCTAGTTGGTCGACGAGCCGCACGCGCCGCGGTTGCGCCTT 

HWNHLDEINQ-LL.GVRGANAE 

P 
s 
t 
I 

GATCCGGTCGCGGTGATCCGCACGCACGTGAACCGGCTCAAGCTGCAGGGCCTCTGA 

1021 + + + 1077 

CTAGGCCAGCGCCAGTAGGCGTGCGTGCACTTGGCCGAGTTCGACGTCGCGCACACT 

DPVAVIRTHVNRLKLQGV* 
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Fig. 5. 

A. DNA sequence of synthetic lipase gene in pUR6038 . 



E 

c C 

0 1 
R a 

1 I 

GAATTCATGGTTAGATCGATGCGTAGCCGTGTTGCAGCTAGAGCGGTCGCATGGGCATTA 

1 -f h 4- 4- 4- 4- 60 

CTTAAGTACCAATCTAGCTACGCATCGGCACAACGTCGATCTCGCCAGCGTACCCGTAA7 
MVRSMRSRVAARAVAWAL 

P 

s 

u 
I 

GCTGTTATGCCATTAG CTGGAG CAGCAGGATTAAC AATGG CTG CGTCTC CTG CAGCGGTC 

61 + + + + + + 120 

CGACAATACGGTAATCGACCTCGTCGTCCTAATTGTTACCGACGCAGAGGACGTCGCCAG 

AVttPLAGAAGLTMAASPAAV 

E 
c 
o 
R 
V 

GCTGCTGACACATATGCAGCTACGAGAtATCCTGTGATCTTGGTTCACGGTTTGGCAGGA 

121 -+ + + + --- + -4- 180 

CGACGACTGTGTATACGTCGATGCTCTATAGGACACTAGAACCAAGTGCCAAACCGTCCT 

AADTYAATRYPVILVHGLAG 

ACGGACAAGTTTGCCAATGTCGTTGACTACTGGTATGGTATCCAGTCAGATCTTCAATCT 

181 + + + + + + 240 

TGCCTGTTCAAACGGTTACAGCAACTGATGACCATACCATAGGTCAGTCTAGAAGTTAGA 

TDKFANVVDYWYGIQSDLQS 

CACGGAGCTAAGGTTTACGTGGCCAATTTGAGCGGATTCCAGTCTGACGATGGCCCAAAC 

241 4--- 4- 4- 4- + + 300 

GTGCCTCGATTCCAAATGCACCGGTTAAACTCGCCTAAGGTCAGACTGCTACCGGGTTTG 

HGAKVYVANLSGFQSDDGPS 
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l Fig. SICont 1). 

U 
I 
I 

GGCCGCGGTGAACAGCTGCTTGCGTACGTCAAACAAGTACTTGCAGCTACGGGAGCTACG 



301 ~r + + +- -r -f 360 

CCGGCGCCACTTGTCGACGAACGCATGCAGTTTGTTCATGAACGTCGATGCCCTCGATGC 

GRGEQLLAYVKQVLAATCAT 

AAGGTCAACTTGATCGGCCACTCTCAAGGTGGCCTTACATCTAGATACGTTGCTGCCGTG 
361 +• + + + + --+ 420 

ttcc agttg aactag c c ggtg ag agttc c accg g aatgtag atctatg c aacg a c g g c a c 

kvnli'ghsqggltsryvaav 

gctcctcagttggtcgccagcgttactacgatcggtacgcctcacagaggctctgagttc 

421 + +- + - - + 480 

CGAGGAGTCAACCAGCGGTCGCAATGATGCTAGCCATGCGGAGTGTCTCCGAGACTCAAG 



APQLVASVTTIGTPHRGSEF 

s 

a. 
1 
I 

GCTGACTTTGTGCAAGACGTCTTGAAGACTGACCCAACAGGACTTTCGTCGACGGTTATT 

481 + + + + r 540 

CGACTGAAACACGTTCTGCAGAACTTCTGACTGGGTTGTCCTGAAAGCAGCTGCCAATAA 

ADFVQDVLKTDPTGLSSTVI 

GCGGCTTTCGTTAACGTTTTCGGCACATTGGTTTCTAGCTCTCACAATACGGATCAGGAC 

541 + + + + + 600 

CGCCGAAAGCAATTGCAAAAGCCGTGTAACCAAAGATCGAGAGTGTTATGCCTAGTCCTG 

AAFVNVFGTLVSSSHNTDQD 

GCCCTTGCTGCATTGCGCACGCTTACAACGGGTCAGACTGCCACGTATAATAGAAACTTT 

601 + + + + + + 660 

CGGGAACGACGTAACGCGTGCGAATGTTGCCGAGTCTGACGGTGCATATTATCTTTGAAA 

ALAALRTLTTAQTATYNRNF 

CCAAGCGCTGGCTTGGGAGCTCCTGGTTCTTGTCAGACGGGCGCAGCTACAGAGACGGTT 

661 -+ + + + -t 720 

GGTTCGCGACCGAACCCTCGAGGACCAAGAACAGTCTGCCCGCGTCGATGTCTCTGCCAA 

PSAGLGAPGSCQTGAATETV 
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P 

v 
u 

I 
I 

GGAGGCTCTCAGCACTTGCTTTACAGCTGGGGAGGTACCGCAATTCAACCAACGTCTACT 

721 + + +- + + + 780 

CCTCCGAGAGTCGTGAACGAAATGTCGACCCCTCCATGGCGTTAAGTTGGTTGCAGATGA 

GGSQHLLYSWGGTAIQPTST 



GTGTTGGGCGTTACGGGAGCTACAGACACAAGCACTGGCACGCTTGATGTCGCGAATGTG 

781 + + + + t + 840 

CACAACCCGCAATGCCCTCGATGTCTGTGTTCGTGACCGTGCGAACTACAGCGCTTACAC 

VLGVTGATDTS tgtldvanv 



ACGGACCCTTCTACGTTGGCTCTTTTGGCTACGGGAGCGGTTATGATCAACCGTGCTrCT 

841 + + — h -r + + 900 

TGCCTGGGAAGATGCAACCGAGAAAACCGATGCCCTCGCCAATACTAGTTGGCACGAAGA 

TDP S TLALLATGAVMI NRAS 

X 
h 
o 
I 

GGACAGAATGACGGCCTTGTCTCGAGATGTAGCTCTTTGTTCGGCCAGGTTATTTCTACG 

901 + + + + -r + 960 

CCTGTCTTACTGCCGGAACAGAGCXCTACATCGAGAAACAAGCCGGTCCAATAAAGATGC 

GQNDGLVSRCSSLFGQVI ST 



TCTTACCACTGGAACCACTTGGATGAAATCAACCAACTTTTGGGTGTGAGAGGCGCCAAT 

961 + + + + + + 1020 

AGAATGGTGACCTTGGTGAACCTACTTTAGTTGGTTGAAAACCCACACTCTCCGCGGTTA 

SYHWNHLDEINQLLGVRGAN 



GCTGAGGACn-fGTTGCCGTCATTCGTACGCACGTCAACAGATTGAAACTTCAGGGAGTT 

1021 + + + + + + 1080 

CGACTCCTGGGACAACGGCAGTAAGCATGCGTGCAGTTGTCTAACTTTGAAGTCCCTCAA 

A E D P V A V I R T H V N R L K 1 0 G 7 
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Fig.5(Cont3) 

H 
i 

B n 
Pa d 
s m- I 

t H I 

II I 
TAATAGCTGCAGTTACTAGGATCCTCATTACAAGCTT 

1081 +--- -+-- + -- 1117 

ATTATCGACGTCAATGATCCTAGGAGTAATGTTCGAA 

* 



B. DNA sequence of 3 '-end of synthetic lipase gene in pUR6600, 



P 
s 
■ t 
I 

GCTGAGGACCCTGTTGCCGTCATTCGTACGCACGTCAACAGATTGAAACTGCAGGGAGTr 

1021 +- + -f + ± 1080 

CGACTCCTGGGACAAGGGCAGTAAGCATGCGTGCAGTTGTCTAACTTTGACGTCCCTCAA 

AEDPVAVIRTHV'NRLKLQGV 

H 
i 

d 
I 
I 
I 

TGAAGCTT 

1081 1088 

ACTTCGAA 
* 
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Fig. 6 n/U9 

3A 3C 3E 3G 3K 3N 3R 3T 



B 



3B 



3D 3F 3H 3M 3P 3S 3U 



pUR 6028 (264 bp) 

EcoRI Sail Hpal 

<- 3A(25) -X- 3CC33) -X- 

AATTCGTAATGATAGTAAGTCGACGGTTATTGCGGCTTTCGTTAACGTTTTCGGCACAT 

G C ATTACTATC ATT C AG CTG C C AATAACG C C G AAAGC AATTG C AAAAG C CGTGT A 
<- 3B(33) -X- 3D(33) 

Hgal Fspl 
3E(33) -X- 3G(36) 

TGGTTTCTA.GCTCTCACAATACGGATCAGGACGCCCTTGCTGCATTGCGCACGCTTACAA 

ACCAAAGATCGAGAGTGTTATGCCTAGTCCTGCGGGAACGACGTAACGCGTGCGAATGTT 
-X- 3F(36) -x- 

SacI 

-X- 3K(33) -X- 3N(33) 

CGGCTCAGACTGCCACGTATAATAGAAACTTTCCAAGCGCTGGCTTGGGAGCTCCTGGTT 

GCCGAGTCTGACGGTGCATATTATCTTTGAAAGGTTCGCGACCGAACCCTCGAGGACCAA 
3H(36) -X- 3M(33) -X- 

PvuII 

-X- 3R(36) -X- 

CTTGTCAGACGGGCGCAGCTACAGAGACGGTTGGAGGCTCTCAGCACTTGCTTTACAGCT 

GAACAGTCTGCCCGCGTCGATGTCTCTGCCAACCTCCGAGAGTCGTGAACGAAATG7CGA 
3? (33) -X- 3SC36) 

Kpnl HindlXI 
3T<35) -> 

*jW»vjrt\j*j j. inl lrw XAt*f*. 



CCCCTC catggaatgatagtaatgttcga 
-x- 3U(2i) -> 
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x Sad./ Hindlll 

isolate vector fragment 



x EcoRV/HindlH 
isolate fragment 
(1080 bp) 
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Fig. 11 



BglII*<0 
BamHI 
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Fig.12(cont) 
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Fig. 16. 



SU50B SU50C 




-9.0kb 
-4.5kb 
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Fig. 17. 

• c 

• B * Bglll 

• C * Bglll 

•' SU 50 * Bglll 



Fig. 18. 

• c 

• B * Bglll 

• C * Bglll 

• SU 50 * Bglll 
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Fig. 22. 
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Fig. 23. 



rDNA (9.1kb) 
pMIRY7-(JA (6.3 kb) 
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Fig. 2U. 

Stability of SU50B in continuous culture 
medium a D= 0.1 1/h 

uptake & [leu] act & d.w. *10 

r 1 ^00 




fermentation time/ hours 

a gal act mg/l — f- cell d.w. g/l*10 
[leucinelres mg/! -&- leu uptake mg/gX/h 



CI IP OT ITI 1-r-i- ■ — 
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Fig. 25. 
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pUR603S (fig- 27) 
* Eco RI 
X EcoRV 

isolate vector fragment 

invertase signal sequence » « lipase 

-13 

K I S A A G 

A : Eco RI-XhoI AAG ATCTCCG CCG C AGGG Eco RV 

-7 

K I S A A S 

B : ECORI-Xhol .... AAG ATCTCCGCCGCATCT Eco RV 

+ 1 

K I S A A D 
C: Eco RI- Xho I AAGATCTCCGCCGCAGAC EcoRV 

o= 

J[ ligate EcoRI -EcoRV fragments- 



Fig. 26. 



pUR6850 
PUR6851 

pUR6852(fig. 28) 



x Xho l , partial, isolate vector fragment 
<c . = 



pUR3501 (fig- 29) 
x Xho l , partial, 
isolate 1500 bp 
fragment . 



PUR68 60 
PUR68 61 

PUR6862 (fig, 30) 



x Bam HI . Klenow filled in 
x EcoR I 

isolate 2500 bp fragment 



PUR3513 (fig, 33) 
X PvuII 



pUR3511 (fig- 31) 
x Sma l 
x Eco RI 
isolate 
vector fragment 



0 



pUR687 0 

pUR6871 

pUR6872 (fig- 32) 

xEcoRI 
xHindlll 

Klenow filled in 
isolate 3000 bp fragment 



pUR688 0 
pUR6881 

pUR6882 (fig. 34) 
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Fig. 36. 
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